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INTRODUCTION 


This  program  seeks  to  obtain  a  fundamental  understanding  of  the  chemical 
mechanisms  by  which  enzymes  repair  damaged  DNA,  and  to  use  this  information  to 
design  small  molecule  inhibitors  of  these  enzymes.  The  driving  force  for  these  efforts  is 
the  recognition  that  the  effectiveness  of  cancer  chemotherapy  regimes  is  intimately 
connected  to,  and  in  some  cases  directly  relies  on,  DNA  damage  repair  pathways.  A 
more  sophisticated  understanding  of  the  roles  of  DNA  damage  repair  in  the 
pharmacology  of  DNA  replication  inhibitors  will  allow  for  the  design  of  better  treatments 
against  breast  and  other  cancers. 


One  particularly  important 
chemotherapeutic  agent  whose  mode  of 
action  has  been  proposed  to  be 
intimately  linked  with  DNA  damage 
repair  pathways  is  5-fluorouracil  (5-FU) 
and  its  congeners,  which  are  used  to 
treat  breast  cancer,  as  well  as  cancers  of 
the  colon,  head  and  neck  ( 1 ,  2).  It  is 
well  known  that  5-FU  targets  the 
biosynthetic  enzyme  thymidylate 
synthase  (TS),  which  generates 
thymidine  monophosphate  from 
deoxyuridine  monophosphate  (dUMP) 

(3).  However,  the  mechanism(s)  of  how 
thymidine  depletion  results  in 
cytotoxicity  (“thymineless  death”)  has 

been  unclear.  One  proposed  model  (Figure  1)  suggests 
that  as  a  result  of  TS  inhibition,  cellular  dUMP  levels 
begin  to  rise  and  is  efficiently  converted  to  deoxyuridine 
triphosphate  (dUTP).  Most  DNA  polymerases  cannot 
discriminate  between  dUTP  and  TTP,  and  thus  the  high 
dUTP  and  low  TTP  levels  results  in  the  misincorporation 
of  uracil  opposite  adenine  in  replicating  DNA.  While 
these  uracils  are  not  mutagenic  per  se,  they  are 
nonetheless  removed  by  the  uracil  base  excision  repair 
(UBER)  pathway.  The  high  density  of  uracil  in  DNA, 
combined  with  the  low  concentration  of  TTP,  lead  to 
futile  repair  cycles  which  would  eventually  result  in  cell 
death  either  through  DNA  fragmentation  and/or  the  accumulation  of  toxic  repair 
intermediates. 

In  order  to  begin  to  explore  the  relationship  between  the  cytotoxic  mechanism  of 
5-FU  and  uracil  base  excision  repair,  we  initiated  a  research  program  to  discover 
inhibitors  to  the  initial  enzyme  in  the  UBER  pathway,  uracil  DNA  glycosylase  (UDG  - 
Tasks  1  and  2  of  the  Approved  Statement  of  Work).  Once  UDG  inhibitors  were 


Figure  2:  Base  flipping  of  a  uracil 
nucleotide  by  UDG  (see  text).  The 

extrahelical  uracil  nucleotide  is 
highlighted  in  gold  (pdb  code  lemh). 


dUTPase 


DNA 


dUMP  - ►TMP 

|  Thymidylate  1 

T  Synthase  t 
dUDP  -i-  TDP 

1 

1 
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dUTP 

5-fluoro- 

TTP 

3  \ 

U-DNA 

dUMP 

1  1 

DNA 

DNA 
Polymerase 


Cell  Death 

Figure  1:  Uracil  DNA  Incorporation  Model  of  5-FU 
Toxicity.  dUMP  is  converted  to  TMP  by  the  action  of 
thymidylate  synthase.  TMP  is  phosphorylated  twice  by 
cellular  kinases  to  yield  TTP,  which  is  incorporated  into  a 
growing  DNA  chain  by  DNA  polymerase.  If  thymidylate 
synthase  is  inhibited  by  the  suicide  inhibitor  5-fluoro-dUMP, 
dUMP  levels  will  rise  and  cellular  kinases  will  convert  it  to 
dUTP.  Eventually,  the  dUTP  concentration  will  be  high 
enough  to  overwhelm  the  activity  of  dUTPase  (which  under 
basal  conditions  keeps  dUTP  very  low)  and  will  be 
incorporated  into  DNA  by  DNA  polymerase.  The  resulting 
uracilated  DNA  will  be  prone  to  damage  from  abortive  uracil 
base  excision  repair  (UBER). 
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identified,  their  effects  on  5-FU  cytotoxicity  and  potency  in  cell  culture  would  be 
evaluated  (Task  3  of  the  Approved  Statement  of  Work). 

BODY 


Oligonucleotide-based  UDG  Inhibitors:  UDG,  like  other  DNA  glycosylases, 
couples  substrate  recognition  and  enzyme  catalysis  through  base  flipping,  a  process 
where  the  cognate  damaged  base  is  rotated  -180°  from  inside  of  the  DNA  double  helix 
and  into  the  enzyme  active  site.  In  the  case  of  UDG,  only  deoxyuridine  can  attain  this 
fully  flipped  confonnation  due  to  the  tight  induced  fit  and  high  complementarity  of  uracil 
within  the  enzyme  active  site  (Figure  2).  Thus,  short  oligonucleotides  containing  a  non¬ 
hydro  lyzable  deoxyuridine  analog  (|3-2’-fluorodeoxyuridine;  UF)  are  specific,  sub¬ 
micromolar  UDG  inhibitors. 

In  an  effort  to  both  improve  potency  and 
develop  a  set  of  mechanistic  tools  for  the  study  of 
enzymatic  base  flipping,  a  series  of  U  -containing 
double  stranded  oligonucleotides  were 
synthesized  where  the  number  of  the  hydrogen 
bonds  in  the  UF  base  pair  was  systematically 
altered  (Figure  3).  During  the  course  of  extruding 
the  target  base  from  within  the  DNA  duplex  to  the 
active  site,  the  uracil  base  pair  must  be  broken. 

The  energetic  penalty  for  disrupting  the  uracil 
base  pair  is  paid  for  by  the  intrinsic  binding 
energy  of  the  uracil-DNA-UDG  complex  (i.e.  the 
sum  of  all  of  the  exergonic  interactions).  We 
proposed  that  the  ablation  of  hydrogen  bonds  in 
the  uracil  base  pair  would  lead  to  increased 
affinity  of  those  oligonucleotides  with  UDG, 
since  less  binding  energy  would  be  expended  by 
the  system  to  flip  the  uracil  out  of  the  DNA 
duplex. 


As  seen  in  Figure  4,  the  removal  of  three 
hydrogen  bonds  from  the  uracil  base  pair  (UF-D 
vs.  Uf-M)  results  in  a  43,000-fold  increase  in 
binding  affinity  (Kn  =  435  nM  and  0.01  nM, 
respectively).  A  detailed  mechanistic  analysis 
using  transient  kinetic  approaches  and  linear-free 
energy  relationships  suggested  that  the  origins  of 
the  enhanced  affinity  of  UDG  for  UF-M  was  due 
not  only  to  removing  the  energetic  penalty  for 
breaking  the  base  pair  hydrogen  bonds,  but  also 
largely  due  to  the  enhanced  local  flexibility  of  the  DNA.  This  increase  in  conformational 
freedom  allows  for  the  DNA  to  more  easily  adopt  the  bent  conformation  observed  in  the 


5 ' -GCG  GCC  AAZ  UrAA  AAA  GCG  C 
3  ’  -CGC  CGG  TTT  X  TT  TTT  CGC  G 

X  -  D,A,G,N,M 
2  -  P,  A 


H 


Figure  3:  Oligonucleotide-based  UDG 
Inhibitors.  Abbreviations:  M,  4-methylindole 
nucleotide;  D,  diaminopurine  nucleotide;  N, 
purine  nucleotide;  U,  2’-p-fluoro-2'- 
deoxyuridine  nucleotide. 
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fully  flipped  out  complex  (Figure  2).  These  results  were  published  as  two  articles  in 
Biochemistry  (see  Reportable  Outcomes). 

Small  Moleucle  UDG  Inhibitors:  While  the  potency  of  the  oligonucleotide-based 
inhibitors  was  vastly  increased  by  a  relatively  small  chemical  manipulation,  they  were 
still  less  than  ideal  tool  molecules  for  cellular  or  in  vivo  studies  due  to  their  poor  drug¬ 
like  properties.  Thus,  we  initiated  a  small  molecule  UDG  inhibitor  discovery  effort. 
Initially,  a  high-throughput  screening  assay  was  developed,  and  a  commercial  10,000 
compound  library  was  screened  against  human  UDG.  No  authentic  human  UDG 
inhibitors  were  confirmed  upon  follow-up  of  the  screening  hits. 

In  order  to  enhance  the 
probability  of  inhibitor 
discovery,  we  developed  a 
novel  chemical  library  focused 
against  UDG.  The  bias  of  this 
library  was  introduced  by 
taking  advantage  of  the 
specific  interactions  the 
enzyme  makes  with 
extrahelical  uracil  in  the  active 
site.  UDG  only  binds  uracil 
with  modest  affinity  (Kn  ~  80 
uM),  but  uracil  is  highly 
complementary  to  the  active 
site.  Thus,  we  used  uracil  as 
an  anchor  into  the  active  site  of 

UDG,  and  then  relied  on  random  exploration  of 
adventitious  binding  sites  proximal  to  the  active 
site  by  tethering  random  small  molecules  to  one 
of  three  fonnyluracils  (Figure  5). 

The  oxime  chemistry  used  in  the  library 
construction  is  ideal  for  use  in  high-throughput 
screening  (HTS)  because  the  reactions  are 
quantitative,  require  no  purifications,  and  the 
resultant  mixtures  can  be  used  directly  in  the  HTS 
assay.  In  addition,  the  ability  to  easily  vary  the 
linker  length  between  the  formyluracil  and  the 
variable  aldehyde  allows  for  increased  chemical 
diversity  in  the  library  and  rapid  optimization  of 
the  spacing  between  the  uracil  and  the  variable 
binding  element 

Using  these  libraries,  an  initial  hit 
containing  a  catechol  moiety,  3-(3)-13,  was 


A. 

BrvrBr  ■ 


'V^N-P 


H2N'°WaNH2 


n 

=  2-6 

^9 

9^n*9 

k 

25%  U 

R  25% 

Figure  5:  High-throughput  synthesis  of  directed  uracil  library  for  human 
UDG  inhibition  using  oxime  chemistry.  (A)  Synthesis  of  alkyldioxamine 
tethers.  (B)  Mixed  oxime  formation  involves  the  reaction  of  an  alkyldioxiamine 
of  variable  length  with  an  uracil  aldehyde  and  a  R-CHO  library.  The  reaction 
proceeds  quantitatively  to  produce  a  statistical  mixture  of  oxime  derivatives  (50% 
U-R  heterosubstituted  tethers).  The  O-alkyl  oxime  libraries  are  rapidly  and 
efficiently  synthesized  in  96-well  plates  without  purification. 
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identified  as  an  inhibitor  of  human  UDG  (Scheme  1).  The  affinity  of  this  compound  ( K\ 

=  1.6  uM)  for  human  UDG  was  50-fold  greater  than  uracil.  Several  analogs  of  3-(3)-13 
were  made,  which  led  to  the  discovery  of  an  analog,  3-(3)-27  (Scheme  1),  with  a  5-6-fold 
increase  in  potency  (K\  =  0.3  uM).  The  chemical  instability  of  these  catechol-containing 
inhibitors  toward  oxidation  limited  their  usefulness  beyond  in  vitro  characterization. 
However,  an  analog  that  replaced  the  vicinal  hydroxyl  groups  with  a  carboxylic  acid  (3- 
(2)-A8,  Scheme  1)  was  nearly  as  potent  as  the  initial  catechol  hit  (K\  =  3.2  uM)  and  is 
chemically  stable.  An  x-ray  co-crystal  structure  of  3-(2)-A8  and  human  UDG  was  solved 
at  1.3  A  resolution,  and  revealed  that  the  inhibitor  mimics  the  shape  and  interactions  of 
damaged  DNA  with  the  enzyme  (Figure  6).  Currently,  we  are  using  this  co-crystal 
structure  as  a  guide  to  design  more  potent  analogs  of  3-(2)-A8  that  will  be  useful  in  cell- 
based  studies  (see  below).  These  studies  resulted  in  three  publications  (see  Reportable 
Outcomes). 

The  current  small  molecule  inhibitors  of  human  UDG  were  insufficiently  potent 
and/or  bioavailable  to  significantly  affect  cellular  UDG  activity.  In  the  absence  of 
suitable  tool  compounds,  the 
interactions  of  uracil  base  excision 
repair  and  5-fluorouracil 
chemotherapy  were  elucidated  in  a 
yeast  model  system  (Seiple,  et  al. 

Nucleic  Acids  Res.  2006,  34,  140-51). 

These  studies  strongly  suggested  that 
UDG  potentiates  the  cytotoxic  effects 
of  5-fluorouracil  by  initiating  futile 
repair  of  the  high  concentrations  of 
deoxyuridine  in  the  DNA  introduced 
as  a  consequence  of  thymidylate 
synthase  inhibition.  Consistent  with 
this  model  were  the  observations  that 
UDG  knockout  yeast  were  resistant  to 
the  effects  of  5-FU,  while  yeast  that 
were  deficient  in  AP  endonuclease  activities  were  hypersensitive  to  5-FU  (i.e.  a  high 
density  of  uracil  in  DNA  is  not  toxic  per  se,  but  the  attempted  repair  of  these  lesions 
triggers  cell  death).  Studies  that  extend  these  observations  from  the  yeast  model  system 
to  mammalian  cells  are  ongoing. 

KEY  RESEARCH  ACCOMPLISHMENTS 

•  Developed  and  characterized  highly  potent  oligonucleotide-based  inhibitors  of 
uracil  DNA  glycosylase  (SOW,  Task  1  and  2). 

•  Developed  novel  chemical  libraries  based  focused  against  DNA  repair  enzymes 
(SOW,  Task  1). 

•  Identified  potent  and  selective  inhibitors  of  human  uracil  DNA  glycosylase 
(SOW,  Task  1  and  2). 


Figure  6.  Close-up  of  3-(2)-A8  bound  in  the  active  site  of  human 
UDG  from  1.3  A  co-crystal  structure  (pdb  code  2hxm). 
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•  Solved  x-ray  co-crystal  structure  of  3-(2)-A8  and  human  uracil  DNA  glycosylase 
at  1.3  A  resolution. 

REPORTABLE  OUTCOMES 

This  work  has  resulted  in  five  published  manuscripts: 

1.  Krosky,  DJ,  Schwarz,  FP,  Stivers,  JT  Linear  Free  Energy  Correlations  for  Enzymatic 
Base  Flipping:  How  Do  Damaged  Base  Pairs  Facilitate  Specific  Recognition? 
Biochemistry  2004,  43,  4188-4195. 

2.  Krosky  DJ,  Song,  F,  Stivers,  JT  The  Origins  of  High- Affinity  Enzyme  Binding  to  An 
Extrahelical  DNA  Base.  Biochemistry  2005,  44,  5949-5959. 

3.  Jiang  YL,  Krosky  DJ,  Seiple  LM,  Stivers  JT.  Uracil-directed  Ligand  Tethering:  An 
Efficient  Strategy  for  Uracil  DNA  Glycosylase  (UNG)  Inhibitor  Development.  J  Am 
Chem  Soc  2005, 127,  17412-20. 

4.  Jiang  YL,  Chung  S,  Krosky  DJ,  Stivers  JT.  Synthesis  and  high-throughput  evaluation 
of  triskelion  uracil  libraries  for  inhibition  of  human  dUTPase  and  UNG2.  Bioorg  Med 
Chem  2006,  14,  5666-72. 

5.  Krosky,  DJ,  Bianchet,  MA,  Seiple,  L,  Chung,  S,  Amzel,  LM,  Stivers,  JT  Mimicking 
Damaged  DNA  with  a  Small  Molecule  Inhibitor  of  Human  UNG2.  Nucleic  Acids  Res 
2006,  34,  5872-9. 

CONCLUSIONS 

The  long-term  goal  of  this  research  is  to  increase  the  effectiveness  of  5- 
fluorouracil  chemotherapy  through  selective  targeting  of  base  excision  repair  enzymes. 

A  first  step  to  achieving  this  end  is  to  gain  a  better  understanding  of  its  mechanism  of 
action  through  the  development  of  specific  inhibitors  of  UBER  enzymes  to  use  as  tool 
compounds.  We  have  progressed  towards  this  goal  by  developing  methodology  to  create 
chemical  libraries  focused  against  DNA  repair  enzymes,  and  we  have  successfully  used 
this  technology  to  discover  the  first  non-nucleotide  inhibitors  of  human  UDG.  At  least 
one  of  these  inhibitors  shows  evidence  of  being  cell  permeable,  and  will  be  used  as  the 
basis  to  design  more  potent  molecules  to  elucidate  the  mechanism  of  action  of  5- 
fluorouracil  against  breast  cancer  cells. 
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abstract:  To  efficiently  maintain  their  genomic  integrity,  DNA  repair  glycosylases  must  exhibit  high 
catalytic  specificity  for  their  cognate  damaged  bases  using  an  extrahelical  recognition  mechanism.  One 
possible  contribution  to  specificity  is  the  weak  base  pairing  and  inherent  instability  of  damaged  sites 
which  may  lead  to  increased  extrahelicity  of  the  damaged  base  and  enhanced  recognition  of  these  sites. 
This  model  predicts  that  the  binding  affinity  of  the  enzyme  should  increase  as  the  thermodynamic  stability 
of  the  lesion  base  pair  decreases,  because  less  work  is  required  to  extrude  the  base  into  its  active  site.  We 
have  tested  this  hypothesis  with  uracil  DNA  glycosylase  (UDG)  by  constructing  a  series  of  DNA  duplexes 
containing  a  single  uracil  (U)  opposite  a  variety  of  bases  (X)  that  formed  from  zero  to  three  hydrogen 
bonds  with  U.  Linear  free  energy  (LFE)  relationships  were  observed  that  correlated  UDG  binding  affinity 
with  the  entropy  and  enthalpy  of  duplex  melting,  and  the  dynamic  accessibility  of  the  damaged  site  to 
chemical  oxidation.  These  LFEs  indicate  that  the  increased  conformational  freedom  of  the  damaged  site 
brought  about  by  enthalpic  destabilization  of  the  base  pair  promotes  the  formation  of  extrahelical  states 
that  enhance  specific  recognition  by  as  much  as  3000-fold.  However,  given  the  small  stability  differences 
between  normal  base  pairs  and  U*A  or  U*G  base  pairs,  relative  base  pair  stability  contributes  little  to  the 
>  106-fold  discrimination  of  UDG  for  uracil  sites  in  cellular  DNA.  In  contrast,  the  intrinsic  instability  of 
other  more  egregious  DNA  lesions  may  contribute  significantly  to  the  specificity  of  other  DNA  repair 
enzymes  that  bind  to  extrahelical  bases. 


The  genetic  information  of  a  cell  can  be  irreversibly  altered 
through  the  chemical  modification  of  nucleotide  bases  (7). 
To  combat  these  mutagenic  effects,  organisms  have  evolved 
a  two-tiered  base  excision  repair  (BER)  pathway  that  handles 
a  wide  array  of  base  lesions  (2).  In  the  first  stage,  a  highly 
specific  DNA  repair  glycosylase  excises  the  damaged  base 
from  the  DNA  (J,  4),  producing  an  abasic  site.  This 
intermediate  is  then  processed  by  the  sequential  action  of 
several  repair  enzymes  that  ultimately  restore  the  site  to  its 
original  state  (2).  Since  damaged  site  specificity  resides  solely 
with  the  DNA  glycosylase,  these  enzymes  must  possess 
extraordinarily  high  catalytic  specificities  (4,  5).  In  the 
absence  of  such  specificity,  undamaged  bases  would  be 
randomly  excised  from  DNA,  leading  to  undesirable  abasic 
sites  and  genetic  instability  (6—10). 

In  general,  enzymatic  specificity  results  from  the  extra¬ 
ordinary  structure  of  enzyme  active  sites  that  disfavors 
formation  of  catalytically  productive  interactions  with  non¬ 
substrate  molecules,  and  strongly  favors  such  interactions 
with  the  true  substrates.  As  an  essential  part  of  their 
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recognition  mechanisms,  all  DNA  glycosylases  extrude  their 
damaged  bases  from  the  DNA  double  helix  in  a  process 
known  as  base  flipping,  thereby  placing  it  extrahelically  into 
their  active  sites  where  specific  interactions  with  the  damaged 
base  can  be  formed  (77,  72).  As  part  of  the  energetic  cost  of 
base  flipping,  the  hydrogen  bonds  and  stacking  interactions 
of  the  base  pair  must  be  disrupted.  Thus,  a  prediction  is  that 
DNA  glycosylases  should  bind  more  tightly  to  damaged  sites 
with  disrupted  base  pairing  because  it  requires  less  binding 
energy  to  flip  the  damaged  base  from  the  destabilized  site 
(4,  13—22).  Such  a  thermodynamic  mechanism  is  quite 
general,  and  would  apply  even  for  DNA  glycosylases  that 
interact  with  the  base  that  opposes  the  damaged  base  (23— 
25). 

How  much  does  the  intrinsic  thermodynamic  stability  of 
the  damaged  base  pair  contribute  to  specific  damaged  site 
binding  by  DNA  glycosylases?  We  have  investigated  this 
question  using  the  enzyme  uracil  DNA  glycosylase  (UDG), 1 
which  removes  uracil  from  U'G  and  (J- A  base  pairs  in  duplex 
DNA  (14,  26,  27).  The  approach  was  to  measure  the  binding 
affinity  of  UDG  for  a  series  of  DNA  duplexes,  in  which  the 
number  of  hydrogen  bonds  (n)  between  uracil  and  its 
opposing  base  (X)  were  systematically  varied  (Figure  1A). 


1  Abbreviations:  UDG,  uracil  DNA  glycosylase;  UF,  2'-/3-fluoro-2'- 

deoxyuridine;  <j>,  abasic  site;  D,  2,6-diaminopurine;  M,  4-methylindole; 

N,  nebularine;  DSC,  differential  scanning  calorimetry;  ip,  pseudo¬ 

dihedral  angle. 
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Figure  1:  (A)  Specific  binding  of  UDG  (E)  to  a  DNA  duplex  containing  deoxyuridine  (U)  opposite  a  purine  analogue  (X).  In  this  study, 
the  number  of  hydrogen  bonds  ( n )  in  the  U*X„  base  pair  varies  from  zero  to  three.  Since  binding  of  uracil  requires  breaking  of  the  base  pair 
hydrogen  bonds,  the  difference  in  binding  free  energy  (AAGbi„d)  between  a  DNA  duplex  with  n  hydrogen  bonds  and  a  reference  duplex 
with  three  hydrogen  bonds  (U‘X3)  should  in  part  reflect  the  reduced  energetic  cost  of  breaking  the  hydrogen  bonds  in  the  base  pair.  (B) 
Dissociation  of  a  duplex  with  a  U‘X„  base  pair  into  two  single  strands.  The  difference  in  the  duplex  melting  free  energy  (AAGmeit)  between 
a  U'X„  duplex  and  the  reference  duplex  (U‘X3)  will  also  reflect  the  energetic  effects  of  a  destabilized  base  pair. 


Then,  these  binding  affinities  were  correlated  with  rigorous 
measurements  of  the  thermodynamic  stabilities  of  these 
duplexes  (Figure  IB).  UDG  was  selected  for  this  study 
because,  unlike  many  DNA  glycosylases  (23—25),  it  does 
not  make  any  direct  contacts  with  the  base  opposite  uracil 
(11).  Thus,  the  observed  changes  in  binding  affinity  can  be 
largely  attributed  to  the  relative  stability  of  the  U*X„  base 
pairs  in  the  free  duplex  DNA  and  not  differential  interactions 
of  the  enzyme  with  the  various  opposing  bases.  These 
quantitative  free  energy  correlations  provide  the  first  direct 
evidence  that  promotion  of  extrahelical  conformations  by 
enthalpic  destabilization  of  a  damaged  site  can  indeed 
enhance  the  specific  binding  of  a  DNA  repair  enzyme. 

EXPERIMENTAL  PROCEDURES2 

Materials.  The  2/-deoxynucleoside  phosphoramidites,  CPG 
supports,  and  DNA  synthesis  reagents  were  purchased  from 
Glen  Research  (Sterling,  VA),  except  for  2'-/?-fluoro-2'- 
deoxyuridine  (UF),  which  was  synthesized  as  described 
previously  (5,  28).  The  oligonucleotides  were  synthesized 
using  standard  phosphoramidite  chemistry  on  an  Applied 
Biosystems  392  synthesizer.  The  oligonucleotides  were 
purified  by  anion  exchange  HPLC  (Zorbax),  followed  by 
C-18  reversed  phase  HPLC  (Phenomenex  Aqua  column). 
Fractions  containing  pure  oligonucleotide  were  concentrated 
to  dryness  in  vacuo,  redissolved  in  MilliQ  water,  and  stored 
at  —20  °C.  The  purity  of  the  oligonucleotides  was  assessed 
by  matrix-assisted  laser  desorption  mass  spectroscopy  and 
denaturing  polyacrylamide  gel  electrophoresis.  The  concen¬ 
tration  of  each  oligonucleotide  was  determined  using  its 
extinction  coefficient  at  260  nm  (29).  DNA  duplexes  were 
hybridized  in  10  mM  Tris-HCl  (pH  8.0)  and  25  mM  NaCl 
as  described  previously  (5).  The  purification  of  Escherichia 
coli  UDG  has  been  described  previously  (30). 

Kd  Measurements.  The  Kn  values  for  binding  of  the  UF- 
X„  duplexes  to  UDG  were  measured  essentially  as  described 
using  a  kinetic  competitive  inhibition  HPLC  assay  under 
conditions  where  the  apparent  K\  is  equal  to  the  Kq  value 
(i.e.,  [S]  Km,  where  S  exhibits  rapid  equilibrium  binding) 
(31).  The  only  modification  was  that  the  abasic  product 
(A<FAp)  and  reactant  (AUAp)  were  separated  using  isocratic 


2  Certain  commercial  materials,  instruments,  and  equipment  are 
identified  herein  to  specify  the  experimental  procedure  as  completely 
as  possible.  In  no  case  does  such  identification  imply  a  recommendation 
or  endorsement  by  the  National  Institute  of  Standards  and  Technology, 
nor  does  it  imply  that  the  material,  instruments,  or  equipment  identified 
is  necessarily  the  best  available  for  the  purpose. 


conditions  with  9.5%  CH3CN  and  0.1  M  triethylammonium 
acetate.  Reaction  mixtures  (35  /<L)  containing  10  mM  Tris- 
HCl  (pH  8.0),  60  mM  NaCl,  12.5  /<g/mL  BSA,  1  /rM  AUAp, 
0.5  nM  UDG,  and  a  variable  amount  of  the  UF-X„  duplex 
were  incubated  at  room  temperature  for  5  min.  The  K\>  for 
each  duplex  was  determined  by  fitting  to  eq  1 

kjk0  =  1/(1  +  [UF-X„]/XD)  (1) 

where  k\  is  the  inhibited  rate  and  ka  is  the  rate  in  the  absence 
of  competitor  DNA.  For  the  tightest  binding  duplex  (UF* 
M),  eq  1  was  modified  to  take  into  account  inhibitor  depletion 
(32). 

The  differences  in  binding  free  energies  relative  to  the 
duplex  with  three  hydrogen  bonds  (UF,D)  were  calculated 
from  the  measured  K\}  values  using  eq  2. 

AAGbind  =  RT  ln[XD(UF-X„)/XD(UF-D)]  (2) 

Fluorescence  Spectroscopy.  To  ascertain  that  all  of  the 
DNA  duplexes  attained  the  same  bound  state,  tryptophan 
fluorescence  measurements  of  free  and  DNA-bound  UDG 
were  performed.  Samples  (497  fiL)  containing  10  mM  Tris- 
HCl  (pH  8.0),  60  mM  NaCl,  and  300  nM  UDG  were 
incubated  for  3  min  at  25  °C  in  a  10  mm  quartz  cuvette, 
and  a  fluorescence  emission  spectrum  was  recorded  in  the 
range  of  325—425  nm  on  a  SPEX  FluoroMax-3  fluorimeter 
(lex  —  295  nm).  Ul  -X„  duplex  DNA  (2.5  mL)  was  then  added 
to  the  UDG  solution  to  give  a  final  DNA  concentration  of 
500  nM.  The  reaction  mixture  was  magnetically  stirred  and 
incubated  for  3  min  at  25  °C,  before  the  fluorescence 
emission  spectrum  of  the  UDG*  DNA  complex  was  recorded. 
The  tryptophan  fluorescence  intensities  of  free  and  DNA- 
bound  UDG  at  333  nm  were  measured,  and  the  raw  values 
were  then  normalized  for  the  fraction  of  UDG  bound  to  each 
DNA  analogue  before  the  ratio  (F’333bound/F’333free)  was  calcu¬ 
lated. 

Differential  Scanning  Calorimetry.  DSC  measurements  of 
duplex  strand  melting  were  taken  using  a  VP-DSC  micro¬ 
calorimeter  from  Microcal,  Inc.  (Northampton,  MA)  es¬ 
sentially  as  described  previously  (33).  The  DNA  solutions 
had  a  concentration  of  20  «M  in  DNA  duplex  with  10  mM 
Na2HP04  (pH  7.5)  and  60  mM  NaCl.  Samples  were 
equilibrated  at  20  °C  for  15  min  and  scanned  up  to  95  °C  at 
a  preset  scan  rate  of  60  °C/h.  The  transition  peak  areas  were 
measured  using  the  EXAM  software  program  (34),  and  the 
transition  peak  areas  were  divided  by  the  DNA  duplex 
concentration  to  provide  the  transition  enthalpies.  Transition 
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Figure  2:  Structures  of  UF-X„  base  pairs:  UF-D  (54,  55),  UF-G 
(56),  UF-N  (57,  58),  UF-<F  (59-61),  and  UF-M  (62). 


entropies  were  determined  from  plots  of  Cp/T  against  T,  by 
dividing  the  transition  peak  area  by  the  DNA  duplex 
concentration.  Procedures  for  buffer  baseline  corrections  and 
accounting  for  pre-  and  post-transition  baselines  have  been 
described  previously  (33). 

KM11O4  Oxidation  Measurements.  Because  of  the  low 
reactivity  of  UF  to  oxidation  (55),  it  was  replaced  with 
thymine  (T)  in  the  oligonucleotides  used  in  this  study.  To 
reaction  mixtures  (20  fiL)  containing  10  mM  Tris-HCl  (pH 
8.0),  60  mM  NaCl,  and  either  100  nM  single-stranded  5'- 
[32P]T  or  5'-[32P]T-X„  duplex  was  added  2.5  mM  KMn04. 
After  the  sample  had  been  incubated  for  3  min  at  room 
temperature,  the  reaction  was  halted  by  the  addition  of  20 
uL  of  a  stop  solution  containing  1 .5  M  sodium  acetate,  1  M 
2-mercaptoethanol,  and  200  /rg/mL  tRNA.  The  samples  were 
processed,  imaged,  and  quantified  as  described  previously 
(52). 

RESULTS 

Binding  of  UDG  to  Destabilized  Damaged  Sites.  A  series 
of  15mer  duplexes  were  constructed  in  which  2'-(i-f\  uoro- 
2'-deoxyuridine  (UF),  a  nonhydrolyzable  uracil  analogue  (5), 
was  placed  opposite  a  series  of  bases  (X)  which  form  zero 
to  three  hydrogen  bonds  with  UF  (Figure  2).  The  affinity  of 
UDG  for  each  UF*X  duplex  was  measured  by  a  competitive 
inhibition  kinetic  assay  in  which  a  3mer  substrate  (AUAp) 
is  separated  from  the  abasic  product  (A'FAp)  using  reverse 
phase  HPLC  (Figure  3A)  (31).  Representative  inhibition  data 
are  shown  in  Figure  3B  for  the  duplex  that  contains  a 
uracil '4-me thy lindole  base  pair  (Ul  -M).  The  Ktl  values  for 


A. 


Figure  3:  Determination  of  the  binding  affinity  of  specific  DNA 
duplexes  using  a  competitive  inhibition  assay.  (A)  UDG  HPLC 
activity  assay.  The  substrate  (AUAp)  and  product  A<J>Ap  are 
indicated.  (B)  Inhibition  of  UDG  by  a  U‘M  duplex  (AT  =  0.24  ± 
0.03  nM). 
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Figure  4:  (A)  Binding  affinities  of  UDG  for  UF-X„  DNA  duplexes. 
The  Kd  values  (nanomolar)  are  shown  above  the  individual  bars. 
(B)  Quench  of  UDG  tryptophan  fluorescence  upon  binding  of  the 
UF-X„  duplex.  The  degree  of  UDG  quenching  was  normalized  to 
reflect  the  quenching  when  UDG  is  saturated  with  DNA  (see 
Experimental  Procedures). 


the  six  duplexes  showed  an  incremental  decrease  as  the 
number  of  hydrogen  bonds  was  ablated  (Figure  4A).  The 
tightest  observed  binding  affinity  was  for  the  UF-M  construct 
that  has  no  hydrogen  bonds  (Ko  —  0.25  nM).  This  affinity 
is  3000-fold  tighter  than  that  of  the  uraciFdiaminopurine 
duplex  that  has  three  hydrogen  bonds  (U'  -D,  K\>  =  820  nM). 
Similar  tight  binding  was  observed  for  the  duplex  with  an 
abasic  site  opposite  UF  (U’-db  Kn  —  2.4  nM),  providing 
additional  evidence  that  complete  ablation  of  hydrogen 
bonding  leads  to  a  significantly  increased  binding  affinity.3 


3  The  observed  Kd  for  the  UF-<1>  duplex  (2.5  nM)  predominantly 
reflects  binding  of  UDG  to  UF,  and  not  to  <lb  because  the  Kd  for  abasic 
DNA  is  much  weaker  (70  nM  for  T-O  and  100  nM  for  <f>-A). 
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Table  1: 

Thermodynamic  Parameters  for  UDG  Binding,  DNA  Melting,  and  Permanganate  Accessibility  of  Destabilized  Base  Pairs 

duplex 

Kd  (nM) 

AGbimi  (kcal/mol) 

A//meit  (kcal/mol) 

ASmeit  (cal  mol  1  K  ’) 

AGmeit"  (kcal/mol) 

log  Sb 

UF-D 

820  ±  90 

-8.3  ±0.1 

58.9  ±  1.8 

0.16  ±0.01 

10.9  ±  2.3 

-1.4  ±0.14 

UF-A 

300  ±  50 

-8.9  ±0.1 

39.6  ±  4.0 

0.11  ±0.01 

7.1  ±5.2 

-1.0  ±0.06 

UF-N 

77  ±  11 

-9.7  ±0.1 

40.5  ±  2.0 

0.10  ±0.01 

9.2  ±  2.5 

NDC 

UF-G 

57  ±6 

-9.9  ±0.1 

34.0  ±  2.4 

0.10  ±0.01 

5.6  ±3.2 

-0.80  ±  0.03 

UF-<E>3 

2.5  ±  0.4 

-11.7  ±0.1 

4.3  ±0.1 

0.01  ±0.01 

0.3  ±  0.3 

-0.22  ±  0.01 

UF-M 

0.24  ±  0.03 

-13.1  ±0.1 

Nl> 

ND" 

ND" 

-0.36  ±  0.01 

“  Calculated  at  298  K.  b  S  is  the  relative  sensitivity  of  a  T-X  base  pair  to  oxidation  by  KMnCL  (see  the  legend  of  Figure  6).  c  Not  determined. 


These  results  indicate  that  removal  of  three  hydrogen  bonds 
can  enhance  specific  recognition  by  up  to  4.8  kcal/mol. 

In  large  part,  these  binding  effects  reflect  the  thermo¬ 
dynamic  properties  of  the  free  damaged  site  because  these 
discrete  base  pair  perturbations  are  not  expected  to  affect 
interactions  between  UDG  and  the  DNA.4  This  conclusion 
is  supported  by  inspection  of  the  crystal  structures  of  UDG 
complexed  with  substrate  analogues,  which  show  that  UDG 
does  not  make  any  interactions  with  the  base  that  opposes 
the  damage  site,  or  with  the  undamaged  strand  (36).  Thus, 
recognition  solely  involves  the  extrahelical  deoxyuridine  and 
not  other  specific  features  of  the  base  pair  or  duplex.  To 
further  establish  that  all  of  the  U'-X  duplexes  used  here  attain 
the  same  bound  state,  and  that  the  observed  effects  on  binding 
largely  arise  from  the  properties  of  the  free  DNA,  we 
measured  the  tryptophan  fluorescence  quenching  upon  bind¬ 
ing  of  each  duplex  (Figure  4B).  Previous  work  has  shown 
that  the  quenching  of  UDG  tryptophan  fluorescence  upon 
specific  DNA  binding  is  a  sensitive  measure  of  an  induced 
fit  conformational  change  in  UDG  that  is  required  to  achieve 
the  final  productive  conformation  with  a  flipped-out  uracil 
(5).  Within  the  errors  of  these  measurements,  all  of  the 
duplexes  produced  the  same  magnitude  of  fluorescence 
quenching,  indicating  that  the  same  bound  conformations 
were  attained  for  all. 

Thermodynamic  Stabilities  of  UF‘X  Duplexes.  We  then 
determined  the  energetic  effects  of  this  series  of  site-specific 
base  pair  disruptions  on  the  thermodynamic  parameters  for 
DNA  melting  using  differential  scanning  calorimetry  (DSC, 
Figure  5).  With  DSC,  one  can  measure  the  enthalpy  (A//meit) 
and  entropy  (ASmeit)  of  the  melting  transition  directly,  and 
unlike  optical  methods,  it  is  insensitive  to  the  mechanism  of 
duplex  melting  (57).  The  free  energy  of  duplex  dissociation 
(AGmeit)  at  any  temperature  can  then  be  simply  calculated 
from  A//meit  and  A5meit,  using  the  relationship  AGmeit  = 
AF/meit  —  7’A.S'ln(.||,  because  the  heat  capacity  of  the  duplex 
and  that  of  the  single  strands  are  equal  (A Cp  ~  0)  (38,  39). 
As  expected,  UF-X  duplexes  exhibited  decreasing  transition 
enthalpies  in  the  A//meit  range  of  58.9—4.3  kcal/mol  as  the 
number  of  hydrogen  bonds  was  decreased.  The  complete 
thermodynamic  parameters  for  duplex  melting  are  reported 
in  Table  1. 

Correlation  of  UDG  Binding  Affinity  with  Damaged  Base 
Pair  Stability.  To  quantitatively  evaluate  the  impact  of 
damaged  base  pair  disruption  on  UDG  affinity,  the  differ- 


4  The  observation  of  linear  free  energy  relationships  between  the 
thermodynamic  parameters  of  the  free  duplex  DNA  and  the  overall 
free  energy  of  DNA  binding  is  not  negated  if  different  energetic 
interactions  exist  between  the  enzyme  and  each  bound  duplex.  However, 
the  observed  slopes  will  reflect  the  relative  effects  of  perturbing  the 
duplex  in  the  free  and  bound  state. 


T  (°C) 

Figure  5:  Differential  scanning  calorimetry  (DSC)  of  the  UF-D 
duplex.  A  20  pM  DNA  duplex  solution  in  10  ttiM  sodium  phosphate 
(pH  7.5)  and  60  mM  NaCl  was  placed  inside  of  the  DSC  sample 
cell,  and  the  change  in  the  heat  capacity  of  the  solution  (AC,,)  was 
monitored  as  it  was  warmed  from  20  to  95  °C  at  a  rate  of  60  °C/h. 
The  thermodynamic  parameters  of  duplex  dissociation  were  ex¬ 
tracted  as  described  in  Experimental  Procedures. 

ences  in  binding  free  energies  (AAGb,„d,  Figure  1)  were 
plotted  against  the  changes  in  transition  enthalpies  (AA//nleit) 
and  entropies  (— TAASmeit)  (Figure  6A,B).  In  this  analysis, 
the  difference  energies  are  relative  to  the  duplex  with  three 
hydrogen  bonds  (UF,D),  and  the  value  of  —  TAASmeit  was 
calculated  at  298  K.  AAGbmd  was  found  to  increase  linearly 
as  the  transition  enthalpy  and  entropy  decrease:  AA//meit 
(slope  =  0.064  ±  0.01,  r2  =  0.937)  and  —  7AASmeit  (slope 
=  0.080  ±  0.01 1,  r2  —  0.949).  As  required  from  these  linear 
correlations,  a  strong  correlation  with  AAGmeit  (slope  =  0.3 
±  0. 1,  r2  —  0.839)  was  also  observed  (data  not  shown).  The 
implications  of  these  correlations  are  discussed  below. 

Correlation  of  UDG  Binding  Affinity  with  Damaged  Base 
Accessibility.  The  above  thermodynamic  correlations  suggest 
that  these  destabilized  duplexes  might  exhibit  an  increase  in 
the  number  of  dynamic  fluctuations  that  promote  extrahelical 
states  of  UF  at  temperatures  well  below  the  duplex  melting 
temperature.  Since  the  dynamic  accessibility  of  the  damaged 
base  is  another  factor  that  could  enhance  its  recognition  by 
DNA  glycosylases,  it  was  of  interest  to  measure  the  relative 
accessibility  of  each  destabilized  base  pair,  and  correlate  this 
parameter  with  UDG  binding  affinity. 

To  explore  this  question,  a  potassium  permanganate 
(KMnOd)  sensitivity  assay  was  employed  (52,  55,  40).  Since 
sites  of  pyrimidine  oxidation  are  susceptible  to  strand 
cleavage  under  basic  conditions,  they  can  be  detected  as 
fragments  using  polyacrylamide  gel  electrophoresis  (Figure 
7A)  (41).  In  these  studies,  UF  was  first  replaced  with  thymine 
(T)  because  of  its  poor  oxidation  reactivity  arising  from  its 
electron  deficient  5,6-double  bond  (data  not  shown).  This  is 
a  very  conservative  change,  as  a  T-X  base  pair  will  have 
base  pairing  strength  and  geometry  nearly  identical  to  those 
of  a  UF*X  base  pair  (42,  43).  As  shown  in  Figure  7A, 
decreasing  the  base  pair  strength  leads  to  an  increase  in  the 
sensitivity  of  the  T-X  base  pair  to  permanganate  oxidation, 
and  also  the  invariant  T-A  base  pair  three  nucleotides  away. 
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Figure  6:  Correlations  between  UDG  binding  affinity  for  each 
UF-X„  duplex  (AAGbimi)  and  the  thermodynamic  parameters  for 
duplex  melting.  The  numbers  of  hydrogen  bonds  in  the  UF-X„  base 
pair  are  indicated.  (A)  AAGbind  vs  AA//meit  (slope  =  0.064,  r2  = 
0.937).  Only  an  upper  limit  for  the  enthalpy  of  melting  for  the  UF- 
M  duplex  was  obtained  (arrow).  (B)  AAGbind  vs  — rAA5meit  (slope 
=  0.080,  r2  =  0.949). 

indicating  that  even  very  conservative  changes  to  the  T-X 
base  pair  can  influence  the  dynamics  of  neighboring  base 
pairs  in  the  duplex  (44).  A  plot  of  log  K0  against  log(relative 
KMnOd  sensitivity)  shows  a  linear  correlation  (Figure  7B), 
establishing  that  increasing  base  accessibility  at  temperatures 
well  below  the  7m  value  has  a  strong  positive  effect  on 
binding  affinity.  As  an  important  control,  the  amount  of 
oxidized  product  was  found  to  increase  linearly  with  respect 
to  time  and  concentration  of  KMn04  (data  not  shown). 
Therefore,  the  differences  in  the  sensitivity  of  these  duplexes 
to  oxidation  directly  reflect  the  unfavorable  dynamic  pre¬ 
equilibrium  for  exposure  of  the  thymidine  base  prior  to 
reaction  with  KMn04. 

DISCUSSION 

Thermodynamic  Framework  for  Active  and  Passive  Base 
Flipping.  While  it  is  clear  from  structural  and  spectroscopic 
studies  that  DNA  glycosylases  bind  their  cognate  damaged 
base  in  an  extrahelical  conformation  (5,  11),  the  pathway 
by  which  the  damaged  base  is  flipped  out  of  the  DNA  duplex 
and  placed  inside  of  the  enzyme  active  site  remains  poorly 
defined.  In  one  model,  DNA  glycosylases  passively  capture 
damaged  bases  that  are  transiently  extrahelical  (Figure  8) 
(14,  45).  According  to  this  view,  the  DNA  glycosylase  does 
not  lower  the  activation  energy  or  equilibrium  for  damaged 
base  flipping,  but  instead  relies  on  the  increased  extrahelical 
propensity  of  damaged  bases  to  enhance  bimolecular  en- 
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Figure  7:  (A)  Sensitivity  of  thymine  in  single-stranded  and  duplex 
DNA  to  oxidation  by  potassium  permanganate.  Samples  of  5'-32P- 
labeled  single-stranded  or  T-X„  duplex  DNAs  were  reacted  with 
2.5  mM  KMnCh  ([O])  for  3  min.  After  the  oxidized  strands  had 
been  cleaved  with  piperidine,  the  reaction  mixtures  were  run  a  19% 
denaturing  polyacrylamide  gel,  and  the  radioactivity  of  each  band 
was  quantified  with  a  phosphorimager.  The  position  on  the  gel  of 
the  full-length  DNA  is  marked  with  a  black  wedge.  (B)  Correlation 
between  UDG  binding  affinity  and  KMnCh  sensitivity  (slope  = 
—2.37  ±  0.270,  r2  =  0.963).  The  relative  sensitivity  of  a  thymine 
in  a  T-X„  base  pair  is  defined  as  [(Z18  —  /bk8d)/(/,otal  _  ^bkgd^dupiex  x 
lOOJ/K/™  -  T*gd)/(710tal  -  /jkgd)Single-stranded  x  J00],  where  FS  is 
the  intensity  of  the  band  corresponding  to  oxidation  of  the  T  in  the 
T-X„  base  pair,  T*811  is  the  background  correction,  and  T01111  is  the 
sum  of  all  of  the  intensities  of  the  bands  in  a  given  lane.  The 
numbers  of  hydrogen  bonds  in  the  T-X„  duplex  are  indicated. 
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Figure  8:  Thermodynamic  model  showing  the  energetic  equiva¬ 
lence  of  the  passive  and  active  base  flipping  mechanisms  (see  the 
text).  The  equivalence  of  the  pathways  is  illustrated  by  the 
thermodynamic  box,  which  requires  that  l/XDaPP  =  A_inbindA'enzflip  = 

/(''Tion  I Y  bind 
A-  flipA-out 

counter.  In  the  alternative  view,  DNA  glycosylases  actively 
flip  out  their  cognate  lesions  by  destabilizing  the  damaged 
site  in  an  initial  encounter  complex.  Active  base  flipping 
may  occur  by  stabilization  of  high-energy  intermediate 
conformations  on  the  base  flipping  pathway,  or  by  the  use 
of  mechanical  forces  to  propel  the  base  from  the  duplex 
(Figure  8)  (5,  46).  It  is  important  to  point  out  that  passive 
and  active  flipping  pathways  cannot  be  distinguished  by 
thermodynamic  measurements  alone.  This  conclusion  is 
required  because  XinbindXenzflip  =  XnonnipXoutbind,  as  shown  in 
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Figure  8.  Only  an  assessment  of  the  kinetic  competence  of 
each  pathway  can  reveal  whether  passive  or  active  base 
flipping  is  the  major  route  taken  for  a  given  system. 

Specific  Recognition  and  Conformational  Freedom  of 
Damaged  Sites.  We  have  found  that  the  affinity  of  UDG  for 
a  specific  site  is  linearly  dependent  on  the  enthalpy  of  duplex 
dissociation  (Figure  6A).  The  linear  relationship  between 
UDG  affinity  and  A//meit  confirms  the  notion  that  weakened 
base  pairing  decreases  the  enthalpy  of  duplex  melting  and, 
consequently,  increases  UDG  binding  affinity  (Figure  6A). 
However,  this  finding  alone  provides  little  insight  into  the 
physical  mechanism  by  which  decreases  in  base  pair  enthalpy 
lead  to  enhanced  binding.  The  small  slope  of  the  correlation 
suggests  that  the  higher  affinity  of  UDG  for  destabilized 
damaged  sites  does  not  simply  arise  from  the  reduced 
enthalpic  cost  of  breaking  a  destabilized  base  pair  during 
base  flipping,  and  that  other  energetic  influences  must  be  at 
work.4  It  should  be  stated  that  the  experiments  presented  here 
are  explicitly  designed  to  probe  the  enthalpic  and  entropic 
contributions  of  the  damaged  site  alone  to  specific  recogni¬ 
tion.  The  total  enthalpy  and  entropy  of  binding  (including 
the  enzyme,  DNA,  and  solvent)  are  not  evaluated  in  any  of 
the  current  experiments,  and  in  fact,  these  measurements  are 
not  required  for  the  conclusions  presented  below. 

The  plot  of  AAGbind  versus  —  7AA5men  reveals  that  there 
is  an  equally  significant  correlation  between  changes  in 
duplex  entropy  and  binding  affinity  (Figure  6B).  The 
parameter  —  7AA5meit  likely  reflects  the  increased  confor¬ 
mational  flexibility  of  the  destabilized  base  pairs  in  the 
duplex  DNA,  because  the  entropy  differences  of  the  dis¬ 
sociated  single-stranded  DNAs  in  the  melting  experiments 
should  be  similar,  given  the  conservative  changes  in  these 
substrates.  If  we  assume  this  physical  interpretation  for  the 
entropy  changes  between  these  DNA  constructs,  the  cor¬ 
relation  suggests  that  increased  flexibility  of  the  base  pair 
produces  conformational  states  that  are  productive  for  UDG 
binding.  The  conclusion  that  extrahelical  conformational 
states  are  produced  is  supported  by  the  correlation  between 
UDG  binding  affinity  and  KMn04  sensitivity  (Figure  7B), 
which  reflects  the  dynamic  equilibrium  of  the  base  between 
an  inaccessible  and  permanganate  accessible  state  (i.e.,  an 
extrahelical  exposed  conformation).  A  reasonable  interpreta¬ 
tion  of  these  combined  findings  is  that  enthalpic  destabiliza¬ 
tion  of  the  base  pair  allows  increased  conformational 
flexibility,  producing  extrahelical  conformers,  some  of  which 
favor  enzyme  binding. 

The  effects  of  base  pair  enthalpy  and  entropy  on  the 
extrahelical  conformational  distributions  that  may  promote 
base  flipping  are  depicted  in  Figure  9A— C.  In  these  panels, 
the  probability  of  an  extrahelical  conformation  is  plotted 
against  the  backbone  pseudodihedral  angle  (xp)  of  the 
deoxyuridine  nucleotide,  defined  as  indicated  in  Figure  9. 
The  angle  xp  has  been  previously  used  in  computational 
studies  to  describe  the  pathway  for  base  flipping  (47),  and 
is  used  here  because  of  its  simple  representation  of  the  base 
flipping  trajectory,  although  none  of  the  arguments  depend 
on  this  formalism.  Using  this  nomenclature,  a  xp  of  10° 
reflects  the  fully  base  paired  state  and  a  xp  of  180°  reflects 
the  fully  extrahelical  state.  In  free  DNA  (Figure  9A),  the  U 
nucleotide  in  the  stable  base  pair  with  three  hydrogen  bonds 
(U'D)  should  be  tightly  centered  around  an  average  confor¬ 
mation  with  a  ip  of  10°  (red  curve),  while  the  unstable 
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Figure  9:  Enthalpic  destabilization  of  the  U-X„  base  pair  leads  to 
extrahelical  conformations  that  promote  binding.  The  probability 
of  an  extrahelical  conformer  as  a  function  of  the  pseudodihedral 
angle  ? p,  which  is  a  measure  of  the  progress  along  the  base  flipping 
reaction  coordinate,  is  indicated  (see  the  text  and  ref  47).  Using 
this  nomenclature,  a  fully  stacked  base  pair  has  a  xp  value  of  ~10°, 
whereas  the  fully  extrahelical  conformation  has  a  xp  value  of  180°. 

(A)  Enthalpic  destabilization  of  the  base  pair  in  the  free  DNA  leads 
to  extrahelical  conformations  that  promote  passive  base  flipping. 

(B)  Base  pair  destabilization  can  also  affect  an  active  mechanism 
in  which  UDG  forms  an  initial  encounter  complex  with  the  DNA 
in  which  the  base  is  not  yet  fully  extrahelical  (5,  46,  48).  In  this 
case,  enzyme  binding  energy  is  used  to  destabilize  the  U‘D  base 
pair,  allowing  it  to  achieve  extrahelical  states  already  available  to 
the  U‘<I>  duplex  due  to  its  intrinsic  instability.  (C)  A  hypothetical 
distribution  of  extrahelical  conformers  for  the  final  UDG'DNA 
complex.  The  enzyme  has  fully  stabilized  the  flipped-out  uracil, 
and  the  distribution  of  conformational  states  for  the  U-D  and  U-<I> 
duplexes  is  narrowly  focused  around  a  xp  of  180°  (36,  47). 

construct  with  no  hydrogen  bonds  (U-<I>)  should  be  broadly 
centered  around  a  xp  value  further  along  the  flipping  reaction 
coordinate  (green  curve).  Thus,  enthalpic  destabilization  of 
the  base  pair  in  a  passive  mechanism  leads  to  conformations 
that  facilitate  binding  of  the  enzyme  (see  Figure  8,  counter¬ 
clockwise  pathway). 

For  the  alternative  active  base  flipping  mechanism  (Figure 
9B),  binding  of  UDG  in  an  initial  encounter  complex  can 
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alter  the  DNA  structure  such  that  the  more  stable  U  ■  13  base 
pair  is  also  destabilized  (Figure  9B,  red  curve)  (5,  46,  48). 
In  this  initial  destabilized  complex,  the  partially  extrahelical 
uracil  may  assume  a  similar  average  conformation,  and  broad 
distribution,  for  both  the  U'D  and  U'O  constructs.  However, 
flipping  of  U  in  the  U'D  construct  requires  a  greater  amount 
of  binding  energy  to  overcome  the  enthalpic  barrier  to  base 
flipping,  resulting  in  weaker  binding  of  the  U'D  duplex 
compared  to  that  of  the  U'<t>  duplex.  Finally,  in  the  Michaelis 
complex,  in  which  the  base  is  fully  extrahelical  (Figure  9C), 
both  the  U'D  and  U'<F  duplexes  assume  the  same  average 
conformation  and  small  conformational  distribution  which 
are  enforced  by  the  strong  interactions  between  the  enzyme 
and  uracil  (36,  46).  Although  this  mechanism  implies  a 
reduction  in  the  conformational  flexibility  of  destabilized 
base  pairs  such  as  the  U  ■  <F  duplex  upon  formation  of  the 
Michaelis  complex,  this  expected  unfavorable  entropic 
contribution  to  the  overall  free  energy  of  binding  may  be 
paid  for  by  the  even  larger  enthalpic  benefit  of  base  pair 
destabilization  for  such  conformationally  flexible  substrates 
(Table  1). 

CONCLUSIONS 

We  have  shown  that  the  maximum  gain  in  specific 
recognition  by  UDG  arising  from  destabilization  of  the 
damaged  base  pair  is  4.8  kcal/mol,  and  that  this  energetic 
effect  likely  arises  from  increasing  the  population  of  extra¬ 
helical  states  that  promote  binding.  Since  the  catalytic 
specificity  of  UDG  for  uracil  as  opposed  to  other  normal 
bases  has  been  estimated  to  be  at  least  8.3  kcal/mol  in  vitro 
(5),  the  energetic  contribution  of  destabilized  damaged  sites 
to  specific  ground  state  binding  can  be  significant,  at  least 
in  this  model  system.  However,  the  in  vivo  substrates  of 
UDG  consist  of  U'A  or  U'G  base  pairs,  which  are  not 
significantly  destabilized  compared  to  other  normal  base 
pairs.  Thus,  spontaneous  base  flipping  at  damaged  sites  is 
not  a  viable  mechanism  for  accounting  for  the  specificity  of 
UDG  in  vivo.  The  remaining  specificity  of  UDG  must  be 
attributed  to  strong  transition  state  interactions  that  can  be 
induced  only  by  actively  flipping  the  uracil  base  into  the 
active  site  (5,  49).  Although  unimportant  for  UDG,  damaged 
site  instability  could  contribute  significantly  to  specific 
recognition  by  repair  enzymes  that  act  on  intrinsically 
unstable  base  pairs  such  as  f/’-methylguanine  (45),  hypox- 
anthine  (50,  51),  A'-methyladenine,  and  W-methylcytosine 
(52,  53). 
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abstract:  Base  flipping  is  a  highly  conserved  strategy  used  by  enzymes  to  gain  catalytic  access  to  DNA 
bases  that  would  otherwise  be  sequestered  in  the  duplex  structure.  A  classic  example  is  the  DNA  repair 
enzyme  uracil  DNA  glycosylase  (UDG)  which  recognizes  and  excises  unwanted  uracil  bases  from  DNA 
using  a  flipping  mechanism.  Previous  work  has  suggested  that  enzymatic  base  flipping  begins  with  dynamic 
breathing  motions  of  the  enzyme-bound  DNA  substrate,  and  then,  only  very  late  during  the  reaction 
trajectory  do  strong  specific  interactions  with  the  extrahelical  uracil  occur.  Here  we  report  that  UDG 
kinetically  and  thermodynamically  prefers  substrate  sites  where  the  uracil  is  paired  with  an  unnatural 
adenine  analogue  that  lacks  any  Watson— Crick  hydrogen-bonding  groups.  The  magnitude  of  the  preference 
is  a  striking  43000-fold  as  compared  to  an  adenine  analogue  that  forms  three  H-bonds.  Transient  kinetic 
and  fluorescence  measurements  suggest  that  preferential  recognition  of  uracil  in  the  context  of  a  series  of 
incrementally  destabilized  base  pairs  arises  from  two  distinct  effects:  weak  or  absent  hydrogen  bonding, 
which  thermodynamically  assists  extrusion,  and,  most  importantly,  increased  flexibility  of  the  site  which 
facilitates  DNA  bending  during  base  flipping.  A  coupled,  stepwise  reaction  coordinate  is  implicated  in 
which  DNA  bending  precedes  base  pair  rupture  and  flipping. 


Enzymes  that  modify  or  cleave  nucleobases  in  DNA,  such 
as  DNA  methyltransferases  and  DNA  glycosylases,  are 
confronted  with  a  formidable  chemical  problem:  gaining 
access  to  substrate  bases  that  are  sequestered  inside  the  DNA 
double  helix  (7—3).  A  conserved  enzymatic  solution  to  this 
problem  is  base  flipping,  where  the  target  base  and  sugar 
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are  extruded  from  the  DNA  duplex  into  the  enzyme  active 
site  (4). 

Stmctural  and  mechanistic  studies  indicate  that  base 
flipping  is  a  multistep  process  involving  two  coupled  reaction 
coordinates  (Figure  1A)  (5—72),  which  are  depicted  using  a 
free  energy  contour  plot  in  Figure  IB.  The  first  coordinate 
involves  ~180°  rotation  of  the  entire  target  nucleotide  from 
the  duplex  stack  (vertical  axis.  Figure  IB),  while  the  second 
coordinate  involves  enzyme-induced  DNA  bending  (hori¬ 
zontal  axis.  Figure  IB).  The  progress  along  each  reaction 
coordinate  may  be  perfectly  synchronized  (diagonal  dashed 
line.  Figure  IB),  or  alternatively,  one  process  may  proceed 
ahead  of  the  other  (curved  trajectory.  Figure  IB).  For 
instance,  if  DNA  bending  proceeds  ahead  of  nucleotide 
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Figure  1:  Uracil  flipping  has  two  coupled  reaction  coordinates  involving  base  rotation  and  DNA  bending.  (A)  Upon  binding  to  free  B 
DNA  (left)  UDG  rotates  the  deoxyuridine  nucleotide  front  the  DNA  base  stack  by  180°  and  bends  the  DNA  by  about  40°.  The  DNA 
structure  on  the  right  was  extracted  from  the  complex  of  UDG  with  substrate  analogue  DNA  (PDB  code  1EMF1)  (8).  (B)  A  two-dimensional 
free  energy  contour  map  depicting  the  two  coupled  reaction  coordinates  of  base  rotation  (vertical  coordinate)  and  DNA  bending  (horizontal 
coordinate).  The  diagram  shows  the  two  enzyme-bound  intermediates  (Ii,  I2)  that  have  been  previously  detected  using  rapid  kinetic  methods 
(5,  6)  and  the  final  product  of  the  flipping  and  bending  reaction  (P).  In  principle,  progress  along  both  reaction  coordinates  may  be  synchronized 
(diagonal  line),  or  one  process  may  lag  behind  the  other.  The  contour  map  depicts  the  reaction  trajectory  (curved  dashed  line)  where 
bending  precedes  flipping,  and  a  low  energy  (blue)  bent  intermediate  is  formed  before  significant  progress  along  the  base  rotation  coordinate 
occurs  (I2).  Thus,  flipping  uracil  from  unbent  DNA  is  a  high-energy  (red)  improbable  process  (upper  left  corner).  Formation  of  I2  can  be 
followed  using  the  increased  fluorescence  of  the  2-aminopurine  probe  which  is  adjacent  to  the  uracil  and  is  very  sensitive  to  base  stacking 
(see  Figure  2)  (5,  6).  Formation  of  P  is  discretely  monitored  by  following  the  decrease  in  tryptophan  fluorescence  of  UDG  that  accompanies 
base  flipping  (5,  6). 


rotation,  a  bent  intermediate  may  form  (lower  right  corner. 
Figure  IB)  before  the  hydrogen  bonds  to  the  target  base  are 
broken  and  the  base  is  expelled  into  the  enzyme  active  site 
(upper  right  comer.  Figure  IB).  A  coupled  reaction  coordi¬ 
nate  for  base  flipping  suggests  that  if  discrete  alterations  in 
the  structural  or  dynamic  features  of  the  DNA  substrate  are 
made,  then  progress  along  one  or  both  coordinates  could  be 
perturbed  in  a  systematic  way.  This  would  allow  a  linear 
free  energy  perturbation  (LFEP)1  analysis  analogous  to  that 
employed  in  simple  chemical  reactions  involving  coupled 
processes  such  as  bond  formation  to  a  nucleophile  and  bond 
breakage  to  a  leaving  group  (13). 

A  LFEP  approach  that  we  are  continuing  to  explore  for 
the  DNA  repair  enzyme  uracil  DNA  glycosylase  (UDG)  is 
to  use  DNA  substrates  in  which  the  number  of  hydrogen 
bonds  between  the  target  uracil  (U)  and  its  opposing  base 
(X)  are  systematically  varied  (Figure  2)  (14).  Our  original 
hypothesis  was  that  removal  of  base  pair  hydrogen  bonds 
would  thermodynamically  facilitate  uracil  flipping  and 
enzyme  binding  by  destabilizing  the  uracil  in  the  DNA  base 
stack  (i.e.,  by  raising  the  free  energy  of  the  lower  left  corner 
of  the  reaction  coordinate  diagram  in  Figure  IB).  A  LFEP 
analysis  has  confirmed  this  initial  expectation,  where  a  strong 
linear  correlation  was  found  between  the  free  energy  of  UDG 
binding  to  a  series  of  DNA  duplexes  with  increasingly 
destabilized  U-X  base  pairs  (m  —  —0.37,  R2  =  0.86)  (14). 
A  key  question  arising  from  this  thermodynamic  study  is 
the  mechanistic  origin  of  the  dramatically  increased  binding 
affinity  to  destabilized  damaged  sites. 

Here  we  use  a  similar  LFEP  approach  to  correlate  the 
effects  of  stepwise  ablation  of  hydrogen  bonds  in  the  U'X 


1  Abbreviations:  LFEP,  linear  free  energy  perturbation;  2-AP, 
2-aminopurine;  FRET,  fluorescence  resonance  energy  transfer;  M, 
4-methylindole  nucleotide;  D,  diaminopurine  nucleotide;  Y,  pyrene 
nucleotide;  N,  purine  nucleotide;  <F,  tetrahydrofuran  abasic  nucleotide; 
UF,  2'-^3-fluoro-2,-deoxyuridine  nucleotide;  UDG,  uracil  DNA  glyco¬ 

sylase. 


5  '  -GCG  GCC  AAZ  UFAA  AAA  GCG  C 
3 ' -CGC  CGG  TTT  X  TT  TTT  CGC  G 
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Z  =  P,  A 
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Figure  2:  Sequences  of  19-rner  DNA  substrates  used  in  the  kinetic 
studies  and  hydrogen-bonding  structures  of  the  UF‘X  base  pairs 
based  on  the  literature:  Uf-d“(57,  58),  UF-G  (59),  UF-N  (47,  48), 
and  UF-M  (29).  Abbreviations:  M,  4-methylindole  nucleotide;  D, 
diaminopurine  nucleotide;  N,  purine  nucleotide;  UF,  2/-/)-fluoro- 
2'-deoxy  uridine  nucleotide. 


base  pair  with  the  rate  of  base  extrusion  (nucleotide  rotation 
reaction  coordinate)  and  the  rate  of  conformational  changes 
in  the  DNA  (bending  reaction  coordinate).  To  monitor  the 
progress  along  these  coordinates,  we  employ  two  indirect 
signal  changes.  The  first  follows  the  increase  in  fluorescence 
of  a  2-aminopurine  (2-AP)  nucleotide  adjacent  to  the  target 
uracil  (5).  This  signal  is  sensitive  to  the  extent  of  stacking 
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of  2-AP  with  the  uracil  (75,  16)  and  thus  reports  on 
conformational  changes  in  the  DNA  as  well  as  progress  along 
the  base  rotation  coordinate  (6,  17—19).  The  second  signal 
involves  a  decrease  in  UDG  tryptophan  fluorescence  that 
accompanies  a  conformational  change  in  UDG  as  it  closes 
around  the  fully  extrahelical  uracil.  This  signal  change  has 
been  shown  by  structural  (8),  kinetic  (5,  6,  77,  20),  and 
mutagenesis  studies  (6,  77)  to  correlate  exclusively  with 
formation  of  the  final  hydrogen-bonding  and  stacking 
interactions  of  the  uracil  base  within  the  enzyme  active  site 
(i.e.,  formation  of  the  final  extrahelical  state  depicted  in  the 
upper  right  corner  of  Figure  IB).  These  studies  suggest  a 
base  flipping  pathway  in  which  DNA  bending  precedes  base 
rotation,  thereby  opening  an  unhindered  passage  by  which 
uracil  may  exit  the  duplex.  Surprisingly,  binding  studies  of 
both  rigid  and  flexible  DNA  duplexes  indicate  that  a  large 
amount  of  enzyme  binding  energy  is  consumed  during  the 
process  of  DNA  bending.  This  finding  appears  to  be  a  general 
feature  of  base  flipping  enzymes  and  is  consistent  with 
previous  suggestions  based  on  structural  and  biochemical 
observations  with  UDG  as  well  as  other  base  flipping 
enzymes  (2,  21—27).  Accordingly,  strong  binding  of  UDG 
to  flexible  target  sites  results  not  only  from  weakened 
hydrogen  bonding  of  the  uracil  but  also  because  flexible  sites 
require  less  enzyme  binding  energy  to  bend.  These  unique 
features  of  target  site  recognition  by  UDG  may  be  shared 
by  other  base  flipping  enzymes  {28). 

EXPERIMENTAL  PROCEDURES 

Materials.  The  2'-deoxynucleoside  phosphoramidites,  CPG 
supports,  and  DNA  synthesis  reagents  were  purchased  from 
Glen  Research  (Sterling,  VA),  except  for  2'-/i-fluoro-2'- 
deoxyuridine  (UF),  4-methylindole  nucleoside  (M),  and 
pyrene  nucleoside  (Y),  which  were  synthesized  as  described 
previously  (5, 29,  30).  The  oligonucleotides  were  synthesized 
using  standard  phosphoramidite  chemistry  on  an  Applied 
Biosystems  392  synthesizer.  The  oligonucleotides  were 
purified  by  anion-exchange  HPLC  (Zorbax  Oligo),  followed 
by  C-18  reversed-phase  HPLC  (Phenomenex  Aqua  column). 
Fractions  containing  pure  oligonucleotide  were  concentrated 
to  dryness  in  vacuo,  redissolved  in  MilliQ  water,  and  stored 
at  —20  °C.  The  purity  of  the  oligonucleotides  was  assessed 
by  matrix-assisted  laser  desorption  mass  spectroscopy  and 
denaturing  polyacrylamide  gel  electrophoresis.  The  concen¬ 
tration  of  each  oligonucleotide  was  determined  using  its 
extinction  coefficient  at  260  nm  (37).  DNA  duplexes  were 
hybridized  in  10  mM  Tris-HCl  (pH  8.0)  and  60  mM  NaCl 
as  described  previously  (5).  The  purification  of  Escherichia 
coli  UDG  has  been  described  previously  (32). 

Kp  Measurements.  The  Kn  values  for  binding  of  the 
UF*X  duplexes  to  UDG  were  measured  using  a  kinetic 
competitive  inhibition  fluorescence  assay  under  conditions 
where  the  apparent  K\  is  equal  to  the  Kn  value  (i.e.,  [S] 

Km,  where  S  exhibits  rapid  equilibrium  binding).  Reaction 
mixtures  (148.5  /<L)  containing  10  mM  Tris-HCl  (pH  8.0), 
60  mM  NaCl,  12.5  /tg/mL  BSA,  2  /<M  AUPA  substrate,  and 
a  variable  amount  of  the  UF,X  duplex  were  placed  in  a  0.3 
cm  quartz  cuvette,  and  its  fluorescence  emission  at  370  nm 
was  monitored  every  10  s  at  25  °C  on  a  SPEX  FluoroMax-3 
fluorometer  (2ex  —  320  nm)  until  the  signal  stabilized.  UDG 
(1.5  fih)  was  then  added  to  the  reaction  mixture  to  give  a 
final  concentration  of  0.25  nM.  The  progress  of  the  reaction 


was  then  monitored  as  described  above  for  5  min.  The  KD 
for  each  duplex  was  determined  by  fitting  to  the  equation 

k/k0=l/(l  +  [UF-X]/£D)  (1) 

where  k\  is  the  inhibited  rate  and  kn  is  the  rate  in  the  absence 
of  competitor  DNA. 

Stopped-Flow  Fluorescence.  The  observed  rate  constants 
for  the  formation  of  UDG -DNA  complexes  were  measured 
on  an  Applied  Photophysics  720  stopped-flow  fluorometer 
(Surrey,  U.K.).  All  measurements  were  performed  under 
pseudo-first-order  conditions  where  the  concentration  of  the 
unlabeled  component  was  at  least  4-fold  greater  the  con¬ 
centration  of  the  labeled  species.  All  measurements  were 
made  using  a  buffer  containing  10  mM  Tris-HCl  (pH  8.0), 
60  mM  NaCl,  and  1  mM  DTT.  For  experiments  where 
changes  in  2-AP  fluorescence  were  observed,  excitation  was 
at  315  nm,  and  a  360  nm  long -pass  emission  filter  was  used. 
In  experiments  where  tryptophan  fluorescence  was  measured, 
an  excitation  wavelength  of  290  nm  was  used  with  a  335 
nm  long-pass  emission  filter.  All  kinetic  traces  were  well 
fitted  to  a  first-order  rate  expression  to  obtain  the  observed 
rate  constant  (k0bSd,  eq  2). 

F,  =  AFexp(l-kobsdr)  +  F0  (2) 

For  the  experiments  in  which  the  rate  of  2-AP  signal 
increase  of  the  DNA  was  monitored,  a  solution  containing  a 
variable  amount  of  UDG  was  rapidly  mixed  with  a  fixed 
concentration  of  2-AP-labeled  DNA.  Final  concentrations  of 
enzyme  and  DNA  after  mixing  were  in  the  range  0.4— 6.4 
/<M  and  100  nM,  respectively.  For  the  experiments  in  which 
the  rate  of  tryptophan  signal  decrease  of  the  enzyme  was 
monitored,  a  solution  containing  a  variable  amount  of  DNA 
was  rapidly  mixed  with  a  fixed  concentration  of  the  enzyme. 
In  these  experiments,  the  final  concentrations  of  DNA  and 
enzyme  after  mixing  were  in  the  range  0.4—12.8  «M  and 
100  nM,  respectively.  Measurements  with  the  UF,M  duplex 
using  2-AP  fluorescence  could  not  be  accurately  made  above 
1 .6  /<M  because  only  20%  of  the  observable  signal  remained 
after  the  ~1  ms  dead  time  of  the  instrument  had  elapsed. 
As  previously  observed,  plots  of  kobsd  against  [UDG]  or 
[UF-X]  were  hyperbolic,  indicating  a  multistep  binding 
mechanism.  The  kinetic  parameters  were  extracted  by  fitting 
the  data  to  a  minimal  two-step  binding  model  (eqs  3  and  4), 

k  k 

E  +  S  ES  E*F  (3) 

k-i  k-2 

,  *off[S]+*off 


where  K'  —  k\l{k-\  +  kmax)  is  the  apparent  affinity  constant, 
kon  —  K'ikn  +  k-2)  is  the  apparent  second-order  association 
rate  constant,  and  ka ff  =  k-ik-2/(k-i  +  k2  +  k-2)  is  the  overall 
dissociation  rate  constant.  The  maximum  rate  constant  for 
the  unimolecular  rearrangements  detected  by  the  2-AP  and 
tryptophan  fluorescence  signals  is  measured  by  the  asymptote 
kmax2AP  (or  kmaxTrp)  =  k2  +  k-2  (5).  Although  base  flipping 
has  been  previously  shown  to  involve  two  internal  steps 
rather  than  the  one  shown  in  eq  3  (6,  7  7,  20),  the  simplified 
analytical  expression  of  eq  4  is  very  useful  for  comparing 
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the  kinetic  behavior  of  a  series  of  substrates  or  mutated 
enzymes  (6). 

Very  slow  dissociation  rate  constants  determined  from  the 
concentration  dependence  of  k0bSd  (eq  3)  have  a  high  degree 
of  uncertainty  because  ka s  is  derived  from  the  v-interccpt  of 
a  plot  of  kobsd  against  [S].  Under  such  conditions,  a  trapping 
experiment  is  required  to  obtain  a  more  precise  value  for 
kaf{.  Trapping  experiments  for  the  substrates  containing  D, 
A,  G,  and  N  in  the  base  pair  with  uracil  were  performed  by 
rapidly  mixing  a  solution  consisting  of  UDG  (200—500  nM) 
and  2-AP-labeled  DNA  (200  nM)  with  a  large  molar  excess 
of  nonfluorescent  trapping  duplex  (20  fiM  AUFA-11)  (6) 
using  a  stopped-flow  fluorometer.  The  time-dependent 
decrease  in  2-AP  fluorescence  as  the  bound  DNA  irreversibly 
dissociated  was  fit  to  a  single-exponential  decay  to  obtain 
k0ff  (eq  5).  Because  of  the  very  slow  dissociation  of  UF,M, 


U7M- 


H-bonds 


lf/N 


U/G 


2  (Wobble) 


100  200  300  400  500 

Kd  (nM) 


Ft  =  AFexp(-koff)  +  F0  (5) 

its  k0ff  was  measured  by  manually  mixing  a  solution 
consisting  of  UDG  (100  nM)  and  PUF,M  (100  nM)  with  1 
/tM  high-affinity  trap,  AUFA/TMT-15  ( 14).  This  reaction 
was  performed  using  a  0.3  cm  path  length  quartz  cuvette 
(150  /<L)  and  a  Spex  FluoroMax-3  fluorometer.  The  time- 
dependent  decrease  in  2-AP  emission  at  370  nm  was 
followed  with  excitation  at  315  nm. 


Change  in  Number  of  H-bonds 


RESULTS 

Binding  of  UDG  to  Destabilized  Damaged  Sites.  We 
synthesized  a  series  of  19-mer  DNA  duplexes  containing  a 
series  of  UF-X  base  pairs  using  a  sequence  based  on  a  duplex 
previously  used  in  rapid  kinetic  studies  of  base  flipping  by 
UDG  (Figure  2)  (5).  The  2'  fluorinated  deoxyuridine 
substrate  analogue  (UF)  is  an  extremely  slow  substrate  for 
UDG  (0/2  ~  1  day),  allowing  measurements  of  DNA  binding 
and  base  flipping  without  the  complication  of  glycosidic  bond 
cleavage  (5).  The  affinity  of  UDG  for  these  duplexes  was 
measured  using  a  competitive  inhibition  kinetic  assay  (14). 
As  previously  observed  for  a  series  of  15-mer  duplexes  with 
a  different  sequence  (14),  the  Kn  values  for  the  19-mers 
decreased  incrementally  as  the  LT|  -X  base  pair  weakened 
(Figure  3 A  and  Table  1).  The  UF,M  duplex,  which  contains 
no  hydrogen  bonds,  binds  a  striking  ~43400-fold  (6.3 
kcal/mol)  more  strongly  than  the  duplex  containing  the 
UF*D  base  pair  with  three  hydrogen  bonds.  Of  note,  the 
incremental  decrease  in  the  free  energy  of  binding  was  only 
about  1  kcal/mol  when  the  first  and  second  hydrogen  bonds 
were  removed  from  the  UF,D  base  pair  (Figure  3B),  which 
is  similar  to  estimates  of  the  free  energy  contribution  of 
individual  hydrogen  bonds  to  the  stability  of  duplex  nucleic 
acids  (55).  In  contrast,  removal  of  the  last  hydrogen  bond  to 
form  the  UF,M  pair  resulted  in  a  much  larger  4  kcal/mol 
decrease  in  binding  free  energy,  indicating  the  presence  of 
additional  energetic  contributions.  These  additional  contribu¬ 
tions  may  involve  a  loss  of  both  hydrogen  bond  and  base 
stacking  interactions,  leading  to  a  large  increase  in  confor¬ 
mational  entropy  of  the  UF,M  base  pair  (14).  This  result 
suggests  that,  with  respect  to  the  free  energy  of  binding, 
increased  flexibility  of  the  site  may  play  a  more  important 
role  than  the  enthalpic  benefit  of  removing  single  hydrogen 
bonds  (see  below). 


Figure  3 

Figure  3:  Binding  affinities  of  UDG  for  UF-X  DNA  duplexes  and 
incremental  change  in  binding  free  energy  when  each  hydrogen 
bond  in  removed.  (A)  Trend  in  dissociation  binding  constants  as 
base  pair  hydrogen  bonds  are  removed  (see  also  Table  1).  The 
number  of  hydrogen  bonds  in  the  UF-X  base  pair  is  shown  to  the 
right  of  the  bars.  (B)  Incremental  change  in  binding  free  energy  as 
each  hydrogen  bond  is  removed  from  the  UF-X  base  pair  (AAGbind 
=  —RT  In  Kd"IKd"-\  where  n  is  the  number  of  hydrogen  bonds  in 
the  base  pair). 

Binding  Kinetics.  To  dissect  the  origins  of  the  dramatically 
enhanced  binding  affinity  of  the  duplexes  with  destabilized 
U'  -X  base  pairs,  the  kinetics  of  association  and  dissociation 
were  measured.  Association  rates  were  measured  using 
stopped-flow  fluorescence  measurements  by  monitoring 
either  the  increase  in  2-AP  fluorescence  that  accompanies 
uracil  unstacking  and  DNA  bending  (Figure  4A)  or  the 
decrease  in  tryptophan  fluorescence  that  marks  the  attainment 
of  the  final  extrahelical  state  (Figure  4B)  (5,  6,  17).  As 
observed  in  previous  studies  using  duplexes  with  U1- A  and 
U'  -G  base  pairs  (5),  plots  of  k0bsd  versus  concentration  were 
hyperbolic  when  either  the  2-AP  or  tryptophan  fluorescence 
signals  were  monitored  (panels  A  and  B  of  Figure  5, 
respectively).  This  kinetic  behavior  indicates  a  change  in  rate- 
limiting  step  from  bimolecular  encounter  at  low  concentra¬ 
tions  of  the  varied  reactant  to  a  unimolecular  conformational 
change  of  the  DNA  and  enzyme  at  high  reactant  concentra¬ 
tions.  All  hyperbolic  plots  were  fit  to  the  two-step  binding 
model  (eq  4),  and  the  kinetic  constants  obtained  from  this 
analysis  are  reported  in  Table  1. 

There  are  revealing  aspects  of  the  kinetics  for  the  2-AP 
and  tryptophan  fluorescence  changes.  First,  for  the  substrate 
with  the  most  stable  base  pair  (UF,D),  the  association  kinetics 
measured  using  the  2-AP  signal  (kon2AP)  is  3-fold  faster  than 
when  the  tryptophan  signal  is  followed  (konTrp)  (Table  1). 
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Table  1:  Binding  Affinities  and  Kinetic  Parameters  for  UDG  Association  and  Dissociation  with  UF-X  Duplexes 


kon2AP  £onTrp 


duplex 

KD(nM) 

1C 2Ae  (jM.-') 

K’  Tip  (/dVR1) 

(/<M  us  *) 

(mM  us  ') 

k0ff  (s  ') 

XDcalc  (nM)d 

UF-D 

434  ±  100“ 

0.29  ±  0.05 

0.12  ±0.02 

206  ±  22 

72  ±7 

44  ±  1.3 

214  ±24 

UF-A 

80  ±  20a  b 

0.08  ±0.01 

0.13  ±0.01 

236  ±  10 

172  ±8 

26  ±  0.3 

110  ±4 

UF-G 

50  ±  20ab 

0.10  ±0.01 

0.25  ±  0.08 

292  ±  13 

331  ±56 

6.9  ±  0.04 

24  ±  1 

UF-N 

8.2  ±  0.3“ 

0.20  ±  0.04 

0.14  ±0.01 

282  ±  28 

315  ±5 

6.0  ±  0.05 

21  ±2“ 

UF-M 

0.01  ±  0.002“ 

0.32  ±0.15 

0.64  ±  0.08 

292  ±  40 

666  ±  45 

0.018  ±0.001 

0.062  ±  0.009“ 

°  Kb  was  determined  by  directly  monitoring  binding  of  DNA  to  UDG  using  2-AP  fluorescence  (see  Experimental  Procedures).  b  Values  previously 
reported  (5).  “  Kb  was  determined  using  competitive  inhibition  assay  with  AUF-X  substrate  (see  Experimental  Procedures).  The  affinity  of  AUF-A 
is  2-fold  greater  than  that  of  PUF-A  (17).  J  Kb  calculated  from  fc0tf/feon2AP. 


A 


Figure  4:  Stopped-flow  fluorescence  kinetic  measurements  of 
UDG  association  with  UF-X  duplex  DNA.  (A)  Approach  to 
equilibrium  association  rate  of  PUF-N  (0. 1  /<M)  and  UDG  (0.4  /tM) 
monitored  by  the  increase  in  2-AP  fluorescence.  (B)  Approach  to 
equilibrium  association  rate  of  UDG  (0.1  )tM)  and  AUF-N  (0.4  ^/M) 
monitored  by  the  decrease  in  tryptophan  fluorescence  of  UDG. 

However,  as  hydrogen  bonds  are  removed  from  the  base  pair 
kon2AP  and  konTrp  become  indistinguishable.  This  result  arises 
because  kon2AP  is  nearly  invariant  across  the  whole  series  of 
duplexes,  while  konTip  increases  by  about  9-fold  (Figure  5C).2 
The  observation  that  /cnn  llp  is  slower  than  kon2AP  when  the 
base  pair  is  strong  requires  that  the  conformational  change 
in  the  enzyme  lags  behind  the  structural  perturbation  in  the 
DNA  that  results  in  the  2-AP  fluorescence  increase.  The 
strong  dependence  of  konTrp  on  base  pair  hydrogen  bonding 
indicates  that  these  hydrogen  bonds  are  broken  very  late 


2  The  asymptotic  ka bsd  values  for  the  2-AP  signal  (kmax2AP  =  fa2AP  + 
k- 22AP;  see  eq  3)  increase  by  about  2-fold  between  UF-D  and  the  UF-A 
or  UF-G  substrates  but  then  decrease  from  these  peak  values  by  about 
2-fold  for  the  UF-N  and  UF-M  substrates.  This  behavior  may  be 
qualitatively  understood  using  the  simplified  two-step  kinetic  analysis 
we  have  employed.  That  is,  k22AF  and  k- 22AF  are  complex  rate  constants 
that  reflect  the  interconversion  of  Ii,  I2,  and  P  on  the  enzyme:  kmm2AF 
first  increases  as  the  conversion  of  I2  —  P  becomes  more  rapid  due  to 
hydrogen  bond  ablation  and  then  decreases  because  the  reverse  rate 
(k- 22AP)  becomes  negligible  for  the  UF-N  and  UF-M  substrates  due  to 
stabilization  of  I2  and  P  (i.e.,  km ax2AP  ~  &22AP;  see  Discussion). 
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Figure  5:  Concentration  dependence  of  the  apparent  association 
rate  constants  (ka bs)  for  the  five  different  UF-X  base  pairs  and 
relative  km  values  as  compared  to  the  UF-D  substrate.  (A) 
Dependence  of  ka bS  on  the  concentration  of  UDG  followed  by 
changes  in  2-AP  fluorescence.  Data  were  fit  to  the  two-step  kinetic 
model  (eqs  3  and  4).  (B)  Dependence  of  ka bs  on  the  concentration 
of  DNA  followed  by  changes  in  Tip  fluorescence.  Data  were  fit  to 
the  two-step  kinetic  model  (eqs  3  and  4).  (C)  Effect  of  changing 
UF-X  base  pair  strength  on  kon,  expressed  as  the  ratio  of  kon(UF-X) 
to  kon(UF'D).  Ratios  determined  from  the  kinetics  of  the  2-AP  and 
tryptophan  fluorescence  changes  are  displayed  separately. 


along  the  reaction  coordinate,  after  the  step  that  is  reported 
on  by  the  2-AP  signal  and  during  the  final  conformational 
step  when  the  enzyme  closes  around  the  uracil  base.  This 
result  implies  that  the  6-fold  2-AP  signal  increase  reflects 
the  formation  of  an  intermediate  (I2,  Figure  1)  in  which  the 
uracil  stacking  interactions  with  the  adjacent  2-AP  base  are 
perturbed,  but  the  uracil  retains  hydrogen  bonding  with  its 
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Figure  6:  Determination  of  kas  using  a  trapping  experiment.  A 
complex  of  PUF-N  (0.20  /<M)  and  UDG  (0.25  ,mM)  was  rapidly 
mixed  with  a  large  excess  of  nonfluorescent  AUFA-11  single- 
stranded  DNA  (20  ^/M)  (6).  The  koff  is  determined  from  fitting  the 
time-dependent  decrease  in  the  2-AP  fluorescence  of  PUF-N  as  it 
is  irreversibly  released  from  UDG  (eq  5). 

partner  base  (6,  1 7).  Since  the  preceding  intermediate  (Ii, 
Figure  1)  is  not  detectably  bent  (7),  and  the  subsequent 
extrahelical  state  is  bent  by  about  40°  (8),  then  the  structural 
implication  of  the  increase  in  2-AP  fluorescence  is  that  the 
DNA  is  bent  in  the  transition  state  leading  to  I2,3  although 
the  magnitude  of  the  bending  at  this  point  along  the  reaction 
coordinate  is  not  known.  It  is  important  to  point  out  that  the 
9-fold  increase  in  kon  pales  in  comparison  with  the  ap¬ 
proximately  43000-fold  increase  in  UDG  binding  affinity 
when  these  three  hydrogen  bonds  are  ablated.  Thus,  the  vast 
majority  of  the  binding  affinity  increase  is  not  attributable 

tO  /ton- 

The  above  results  strongly  suggest  that  the  large  thermo¬ 
dynamic  benefit  of  removing  hydrogen  bonds  in  the  UF-X 
base  pair  arises  from  profound  differences  in  the  dissociation 
rates  of  these  duplexes.  To  establish  this  point,  the  dissocia¬ 
tion  rates  (k0 ff)  were  measured  using  an  irreversible  trapping 
experiment  (5)  in  which  a  solution  of  UDG*(PUF,X)  was 
rapidly  mixed  with  a  large  excess  of  unlabeled  trapping 
DNA,  and  the  decrease  in  the  2-AP  signal  was  followed 
(Figure  6).  The  dissociation  rate  constants  are  reported  in 
Table  1  and  are  consistent  with  the  y-intercepts  (k„nj  esti¬ 
mated  from  the  concentration  dependence  of  the  association 
rates  in  Figure  5A,B.  The  dissociation  rates  were  strongly 
dependent  on  the  nature  of  the  U'  -X  base  pair,  decreasing 
by  over  4  orders  of  magnitude  over  the  entire  series,  with 
koff(UF-D)  =  44  s-1  and  koff(UF-M)  =  0.018  s~\  These  data 
provide  strong  evidence  that  removal  of  target  base  pair 
hydrogen  bonds  leads  to  profound  stabilization  of  one  or 
more  enzyme-bound  species  on  the  uracil  flipping  pathway. 
Since  the  kinetic  effect  of  removing  hydrogen  bonds  appears 
very  late  during  the  uracil  flipping  process  (i.e.,  during  the 


3  We  were  unable  to  detect  DNA  bending  by  UDG  employing 
standard  fluorescence  resonance  energy  transfer  (FRET)  methods  and 
DNA  substrates  with  fluorescence  donor  and  acceptor  groups  on  each 
end  (51,  52).  We  attribute  this  result  to  interaction  of  the  rhodamine 
fluorophore  with  binding  sites  in  the  free  DNA  (55,  54),  which  may 
be  disrupted  when  the  enzyme  binds,  resulting  in  no  FRET  change. 
Based  on  crystallographic  models  of  bent  DNA  bound  to  UDG  (not 
shown),  we  would  have  expected  a  substantial  1.4-fold  FRET  increase. 
The  same  negative  result  was  obtained  when  the  donor  or  acceptor 
was  placed  on  the  3'  or  5'  end  of  the  uracil-containing  strand  or  when 
a  15-mer  or  19-mer  duplex  was  used.  We  note  that  an  unexpected  and 
small  15%  FRET  decrease  was  recently  reported  for  EcoRI  methyl- 
transferase  which  also  flips  a  DNA  base  by  a  putative  bending 
mechanism,  although  an  increase  was  expected  (55).  We  conclude  that 
FRET  results  should  be  interpreted  cautiously  in  base  flipping  systems. 


Figure  7 :  Logarithmic  dependences  of  kon  and  kD ff  on  the  binding 
affinity  of  UDG  for  UF-X  DNA  (pA'o).  (A)  Logarithmic  dependence 
of  kon  on  p Ko-  The  data  were  fit  to  the  empirical  equation  log  fcon 
=  log  ki iff  —  log(l  +  10pA°~pJG),  where  k^ iff  is  the  rate  constant  for 
diffusional  encounter  and  p K0  is  an  empirical  constant.  (B)  Linear 
dependence  of  log  fc0ff  on  binding  affinity  (slope  =  0.74  ±  0.07, 
R2  =  0.97).  For  reference,  the  number  of  hydrogen  bonds  in  each 
destabilized  UF-X  base  pair  is  indicated  above  each  data  point. 

last  conformational  step  monitored  by  tryptophan  fluores¬ 
cence;  see  above),  then  the  stabilized  species  must  be  one 
or  both  of  the  complexes  shown  in  the  upper  and  lower  right- 
hand  corners  of  Figure  IB  (I2,  P). 

Linear  Free  Energy  Correlations.  The  quantitative  de¬ 
pendence  of  kon  and  k,M  on  UDG  binding  affinity  provides 
further  insights  into  the  mechanistic  basis  for  the  enhanced 
binding  of  duplexes  with  destabilized  base  pairs.  A  plot  of 
log  kon2AP  versus  p AT d  is  nearly  flat  across  the  whole  series 
(Figure  7A,  triangles)  with  an  average  value  kon  —  2.8  x 
10s  M-1  s_)  indicative  of  near  diffusion-controlled  binding 
except  for  the  substrate  with  the  most  stable  base  pair 
(UF,D)  which  falls  slightly  below  this  average  value.  Thus, 
all  internal  kinetic  steps  preceding  and  including  the  2-AP 
fluorescence  change  are  rapid  as  compared  to  the  encounter 
rate.  In  contrast,  k,„f'v  shows  a  linear  increase  as  a  function 
of  pXD  for  the  more  stable  UF,D  and  UF,A  duplexes, 
followed  by  downward  curvature  to  a  plateau  value  of  konTrp 
=  4  x  10s  M-1  s-1  for  the  most  destabilized  base  pairs 
(Figure  7A,  squares).  This  behavior  indicates  a  change  in 
rate-limiting  step  as  a  function  of  base  pair  stability  from 
the  step  giving  rise  to  the  tryptophan  fluorescence  change 
to  that  involving  diffusion-controlled  encounter.  We  conclude 
that  the  thermodynamic  stability  of  the  IT‘X  base  selectively 
alters  the  activation  barrier  for  the  step  reported  by  k,„frp. 

In  contrast  with  log  kon,  there  is  a  linear  dependence  of 
log  kQff  on  pXn,  with  the  dissociation  rate  decreasing  steeply 
as  UDG  affinity  increases  (Figure  7B).  From  the  slope  of 
this  correlation  (—0.74  ±  0.07),  it  can  be  inferred  that  about 
74%  of  the  difference  in  binding  energies  between  any  two 
Ul  -X  duplexes  arises  from  k0ff.  The  simplest  mechanism  that 
accounts  for  the  small  effect  of  base  pair  strength  on  kou, 
and  the  large  effect  on  koff,  is  the  progressive  stabilization 
of  one  or  more  enzyme-bound  species  as  hydrogen  bonds 
are  removed  (see  Discussion). 
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Table  2:  Effect  of  Pyrene  Nucleoside  on  Binding  Affinity  of  UDG 
to  Uracil  and  the  Abasic  Site  Containing  DNA“ 


Duplex  DNA 


Kd  (nM) 
300  ±  50 

120  ±  30 

0.24  ±  0.03 
100  ±  10 

8700  ± 675 
18  ±3.0 


°  The  sequence  of  DNA  corresponds  to  15-mer  duplexes  in  ref  14. 
<!>  =  tetrahydrofuran  abasic  nucleotide  analogue. 


Base  Pair  Hydrogen  Bonds,  DNA  Flexibility,  and  UDG 
Binding.  To  test  the  role  of  DNA  flexibility  on  binding 
affinity,  we  employed  a  novel  structure— reactivity  approach. 
This  strategy  takes  advantage  of  the  decrease  in  flexibility 
of  the  target  site  when  it  is  modified  with  a  rigid  pyrene 
nucleotide  on  one  strand  (Table  2)  (77,  30,  34—36).  We 
originally  used  a  uracil— pyrene  base  pair  (UF-Y)  to  preor¬ 
ganize  uracil  in  an  extrahelical  conformation  (34),  with  the 
rationale  that  the  planar  aromatic  ring  structure  of  pyrene 
would  fully  encompass  the  entire  volume  normally  occupied 
by  the  standard  U1- A  base  pair,  thereby  presenting  the  uracil 
to  the  enzyme  in  an  extrahelical  conformation  that  favors 
binding  (i.e.,  the  energetic  penalty  for  uracil  flipping  is 
prepaid,  allowing  for  tighter  binding).  In  accord  with  this 
reasoning,  DNA  that  contained  a  UF*Y  pair  fully  rescued 
the  catalytic  and  base  flipping  activities  of  UDG  mutants 
that  were  incapable  of  pushing  uracil  from  the  DNA  base 
stack  in  the  context  of  a  UF,A  base  pair  (77,  34). 

Although  base  flipping  mutants  showed  dramatic  increases 
in  their  site-specific  binding  affinity  and  catalytic  activity 
when  a  UF*Y  pair  was  used,  wild-type  UDG  bound  with 
nearly  equal  affinity  to  DNA  containing  either  a  U'  -Y  or 
UF*A  base  pair  (34).  This  finding  is  reaffirmed  in  Table  2 
where  the  binding  affinity  of  UDG  to  a  15-mer  duplex 
containing  a  U'  -A  and  U'  -Y  pair  is  shown  to  differ  by  only 
2.5-fold.  This  result  is  in  dramatic  contrast  to  the  1200-fold 
tighter  binding  of  the  UF-M  duplex  as  compared  to  U'  -A 
using  the  same  15-mer  context  (Table  2).  These  observations 
indicate  that  there  is  a  cryptic  energetic  penalty  for  binding 
of  the  UF-Y  duplex  that  is  not  present  for  binding  of  the 
U'-M  duplex  even  though  U'-M  and  U'-Y  both  lack 
hydrogen  bonds  to  the  uracil  base.  Thus  a  key  question  is 
the  basis  for  the  dramatically  tighter  binding  of  the  UF,M 
duplex  as  compared  to  UF,Y  (AAG  =  3.7  kcal/mol). 

A  likely  explanation  is  that  the  pyrene  nucleoside  serves 
to  stiffen  the  DNA  at  the  apex  of  the  bend  induced  during 
base  flipping  and  thus  requires  a  greater  expenditure  of 


binding  energy  by  UDG  in  order  to  distort  the  DNA  as 
compared  to  UF,M.  This  idea  is  consistent  with  previous 
computational  findings  suggesting  that  decreased  DNA 
flexibility  can  lead  to  reduced  UDG  binding  affinity  by 
increasing  the  resistance  of  the  DNA  to  adopt  the  bend 
enforced  by  protein  binding  (10).  Increased  structural  rigidity 
introduced  by  pyrene  is  chemically  reasonable  because  the 
extended  Jt  aromatic  system  of  the  pyrene  base  pair  mimic 
is  much  more  rigid  than  the  native  UF,A  base  pair  which  is 
only  restrained  by  noncovalent  hydrogen  bonding.  Assuming 
this  interpretation  for  binding  of  the  UF,Y  base  pair,  its 
unfavorable  rigidity  is  offset  by  the  energetic  benefit  of 
extrahelical  preorganization  of  the  uracil.  Thus,  the  binding 
affinity  of  U,  -Y  to  UDG  is  similar  to  a  Ul  -A  base  pair 
because  of  compensatory  energetic  effects,  but  UF-M  binds 
more  tightly  because  of  the  increased  flexibility  of  the  site 
(Table  2). 

To  further  test  the  proposal  that  rigidity  of  the  DNA  has 
an  adverse  effect  on  UDG  binding  affinity,  we  tested  the 
effect  of  the  opposing  base  (A,  M,  or  Y)  on  binding  of  DNA 
that  contained  an  abasic  site  product  analogue  ( <I> )  (Table 
2).  Structural  studies  have  shown  that  UDG  can  bind 
specifically  to  and  flip  out  abasic  sites  to  form  a  structurally 
indistinguishable  complex  as  compared  to  substrate  analogue 
DNA  (2,  8,  37,  38).  However,  because  there  is  no  require¬ 
ment  for  rupture  of  a  base  pair  upon  flipping  of  an  abasic 
nucleotide,  and  there  is  no  uracil  attached  to  form  hydrogen 
bonds  and  stacking  interactions  in  the  UDG  active  site,  the 
changes  in  binding  affinity  between  the  <b*Y  and  <3>M 
duplexes  should  largely  reflect  differences  in  the  intrinsic 
flexibility  of  the  target  site.  As  shown  in  Table  2,  <T  ■  M  binds 
''-480-fold  more  tightly  than  <b'Y  and  5-fold  more  tightly 
than  d^'A.  The  observed  greater  affinity  of  UDG  for  $*M 
over  d>-Y  is  identical  to  that  of  U'  -M  and  U'  -Y  (Table  2). 
We  conclude  from  these  results  that  pyrene  introduces  an 
unfavorable  free  energy  contribution  to  binding  of  as  much 
as  3.7  kcal/mol  arising  from  its  increased  rigidity  and 
stacking  as  compared  to  sites  that  contain  A  or  M  as  the 
opposing  base.  In  conclusion,  the  observation  that  flexible 
abasic  sites  bind  with  much  higher  affinity  than  rigid  abasic 
sites  provides  strong  experimental  evidence  that  specific 
recognition  involves  a  flexibility  component  regardless  of 
the  presence  of  a  uracil  base. 

DISCUSSION 

The  mechanism  by  which  enzymes  obtain  extraordinary 
specificity  for  their  respective  target  sites  in  a  large  back¬ 
ground  of  random  DNA  sequences  has  long  been  an  active 
area  of  research.  In  this  general  area,  DNA  glycosylases  have 
evolved  a  unique  solution  to  this  problem  that  is  divergent 
from  the  sequence-dependent  mechanisms  of  restriction 
enzymes,  transcription  factors,  and  repressor  proteins.  Ob¬ 
taining  a  fundamental  understanding  of  the  kinetic  and 
thermodynamic  origins  of  DNA  repair  glycosylase  specificity 
has  the  potential  to  allow  rational  targeting  of  these  enzymes 
to  engineered  DNA  sites  that  display  features  that  promote 
enhanced  binding  and  reactivity.  Indeed,  we  have  previously 
shown  that  simple  rules  derived  from  mechanistic  studies 
of  base  flipping  and  catalysis  can  be  used  to  alter  UDG’s 
specificity  to  recognize  a  cytosine  base  opposite  to  pyrene 
(i.e.,  a  C‘Y  base  pair)  rather  than  uracil  (39).  Such  findings 
offer  the  promise  of  altering  the  coding  sequences  of  genes 
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in  vivo  using  engineered  glycosylases  in  combination  with 
assisting  targeting  molecules  such  as  pyrene-containing 
oligonucleotides. 

The  Reaction  Coordinate  for  Base  Flipping.  It  is  difficult 
to  imagine  the  pathway  for  flipping  a  base  180°  from  the 
DNA  base  stack  ultimately  leading  to  its  precise  docking 
into  an  enzyme  active  site.  It  is  equally  difficult  to  view  how 
an  enzyme  could  form  specific  interactions  to  guide  a  target 
base  through  a  trajectory  that  arcs  widely  out  of  the  DNA 
helix  (Figure  1  A)  (7,  55).  The  present  data  suggest  plausible 
mechanistic  answers  to  these  questions  that  are  consistent 
with  previous  NMR  dynamic  and  structural  studies  of  the 
earliest  steps  in  the  base  flipping  process  (7,  55)  and  also 
crystallographic  studies  of  the  final  extrahelical  complex  (8). 

To  understand  how  an  enzyme  might  facilitate  the  overall 
base  flipping  process,  it  is  useful  to  consider  the  intrinsic 
kinetic  and  thermodynamic  problems  that  must  be  overcome 
in  extruding  a  base  from  the  B  DNA  base  stack  (Figure  8). 
First,  NMR  imino  proton  exchange  experiments  have  estab¬ 
lished  that  T*A,  T'G,  and  U*A  base  pairs  rapidly  open  at 
room  temperature  (7,  40,  41).  These  opening  rates  are  greater 
than  or  equal  to  the  rate  constants  for  kinetic  steps  on  the 
base  flipping  of  UDG  (Table  1),  indicating  that  spontaneous 
base  pair  opening  provides  kinetically  competent  motions 
for  seeding  the  enzymatic  base  flipping  process.  Indeed, 
recent  studies  of  imino  proton  exchange  of  UDG-bound  DNA 
have  established  that  UDG  does  not  alter  the  opening  rate 
but  instead  substantially  slows  the  closing  rate  constant  of 
open  T*A  base  pairs  (7).  This  result  is  intuitively  satisfying 
because  the  base  pair  closing  rates  in  B  DNA  are  exceedingly 
fast  (~107  s-1)  (42,  43),  and  one  major  problem  in  facilitating 
the  migration  of  a  base  forward  along  the  flipping  trajectory 
is  to  prevent  its  retrograde  motion  back  into  the  DNA  base 
stack  (Figure  8A).  On  the  basis  that  adjacent  T*A  and  G*C 
base  pairs  showed  identical  closing  rates  in  the  UDG 
complex  (~105  s_1),  but  these  rates  differed  by  25-fold  in 
the  free  DNA,  we  proposed  that  UDG  uses  nonspecific  DNA 
backbone  interactions  to  increase  the  lifetime  of  a  high- 
energy  open  state  of  the  bound  DNA  that  occurs  very  early 
on  the  flipping  trajectory  (Ij,  Figure  8A)  (7).  This  open  state 
of  UDG-bound  DNA  is  still  extremely  unstable  and  exists 
in  dynamic  equilibrium  with  closed  B  DNA.  Nevertheless, 
the  equilibrium  is  now  shifted  more  toward  the  open  state 
as  compared  to  the  free  DNA  (Figure  8A). 

The  next  intermediate  (I2)  on  the  flipping  pathway  has  an 
altered  B  DNA  structure  such  that  the  uracil  stacking 
interactions  with  the  adjacent  2-AP  are  perturbed,  but  on 
average,  the  base  pair  hydrogen  bonds  are  still  intact  (Figure 
8A).  The  experimental  observations  that  support  these 
features  of  I2  are  (i)  the  rapid  2-AP  fluorescence  change 
indicating  a  perturbation  in  the  stacking  of  uracil  with  2-AP 
during  the  formation  of  I2  (Figure  4A)  and  (ii)  that  the  kinetic 
step  between  I2  and  the  final  extrahelical  state  (P)  is 
kinetically  enhanced  as  hydrogen  bonds  are  removed  from 
the  U*X  base  pair  (Figure  5C),  indicating  that  these  bonds 
persist  after  formation  of  I2.  Once  the  final  state  is  attained 
and  the  uracil  is  docked  and  held  securely  in  the  active  site 
by  stacking  and  hydrogen-bonding  interactions  (8,  38,  44— 
46),  the  extrahelical  base  cannot  reenter  the  DNA  stack 
without  climbing  the  reverse  activation  barrier  back  to  I2. 
Thus,  the  base  flipping  process  may  be  viewed  as  the 
progressive  formation  of  enzyme-bound  states  in  which  the 
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uracil  base  spends  an  ever  increasing  time  in  an  extrahelical 
conformation  as  depicted  in  Figure  8A. 

The  Importance  of  Being  Flexible.  The  kinetic  and 
thermodynamic  data  indicate  that  as  the  base  pair  hydrogen 
bonds  are  removed,  the  second  intermediate  (I2),  or  the  final 
extrahelical  state  (P),  is  increasingly  stabilized  (Figure  8B). 
The  results  also  reveal  that  removal  of  the  first  two  hydrogen 
bonds  alters  binding  affinity  by  ~1  kcal/mol  per  hydrogen 
bond  but  that  removal  of  the  final  hydrogen  bond  results  in 
a  significantly  larger  4  kcal/mol  increase  in  the  binding 
affinity  (Figure  3B).  Assuming  that  the  base  pair  remains 
largely  intact  when  the  first  two  hydrogen  bonds  are  broken, 
which  is  supported  by  both  NMR  and  potassium  perman¬ 
ganate  oxidation  sensitivity  measurements  (14,  47,  48),  then 
the  1  kcal/mol  free  energy  increment  may  provide  an  estimate 
of  the  individual  hydrogen  bond  enthalpy  (55).  The  free 
energy  change  when  the  last  bond  is  removed  is  substantially 
greater  than  estimates  of  DNA  base  hydrogen  bond  energy 
(55, 49),  indicating  the  loss  of  multiple  energetic  interactions, 
perhaps  including  stacking  of  the  uracil  with  adjacent  bases. 
We  suggest  that  one  ramification  of  total  ablation  of 
hydrogen  bonds  in  the  U'  -X  pair  is  increased  conformational 
flexibility  of  the  target  site.  Such  conformational  flexibility 
would  be  expected  to  facilitate  DNA  bending  and,  therefore, 
binding  affinity.  The  conclusion  that  hydrogen  bond  ablation 
increases  target  site  flexibility  is  further  supported  by  the 
3.7  kcal/mol  unfavorable  effect  of  the  rigid  pyrene  (Y) 
nucleotide  on  binding  as  compared  to  4-methylindole  (M) 
when  these  nucleotides  are  placed  opposite  to  uracil  or  an 
abasic  site  in  duplex  DNA  (Table  2).  We  infer  that  pyrene, 
by  forming  strong  stacking  interactions  and  occupying  a 
space  that  spans  the  width  of  the  DNA  duplex,  significantly 
increases  the  resistance  of  the  DNA  toward  bending,  which 
is  in  turn  reflected  in  the  binding  affinity  measurements.  Of 
course,  the  other  destabilized  base  pairs  may  also  exhibit 
changes  in  stacking  energetics,  but  these  differences  are 
expected  to  be  small  given  the  very  conservative  changes  in 
the  nature  of  the  opposing  base  (50).  These  differences  in 
stacking  energies,  if  present,  would  also  be  manifested  in 
the  flexibility  of  the  DNA  and  would  therefore  contribute 
to  the  energetics  of  enzyme-induced  DNA  bending. 

A  free  energy  diagram  depicting  the  significant  effect  of 
target  site  flexibility  on  binding  is  shown  in  Figure  8B.  The 
diagram  highlights  how  weaker  U'  -X  base  pairs  are  easier 
to  bend  due  to  their  increased  conformational  flexibility, 
leading  to  a  decrease  in  the  free  energy  of  the  bent  DNA, 
reflected  in  the  stability  of  I2  and  P  in  Figure  8B.4  In  addition 
to  the  energetic  benefit  of  increased  flexibility  of  the  U'  -M 
substrate  (~3  kcal/mol),  the  P  state  realizes  the  additional 
energetic  benefit  of  forming  hydrogen  bonds  with  the  uracil 
base  without  the  penalty  of  breaking  hydrogen  bonds  in  the 
base  pair  (~3  kcal/mol). 

The  data  uncover  that  a  significant  portion  of  UDG’s 
intrinsic  binding  energy  is  used  to  drive  the  unfavorable 
process  of  DNA  bending.  Thus,  a  key  question  is  the  benefit 


4  This  is  similar  to  our  previous  conclusion  that  enthalpically 
destabilizing  the  uracil  base  pair  promotes  enhanced  binding  by 
increasing  the  probability  of  extrahelical  states  that  are  recognized  by 
the  enzyme  (14).  However,  our  previous  interpretation  did  not  include 
the  effect  of  base  pair  flexibility  on  the  energetics  of  DNA  bending. 
Indeed,  increased  flexibility  appears  to  be  more  important  than  the 
enthalpic  benefit  of  removing  hydrogen  bonds. 
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Figure  8:  Free  energy  diagrams  depicting  the  increasing  base  flipping  equilibrium  in  each  intermediate  along  the  reaction  coordinate  for 
uracil  flipping  and  the  effect  of  base  pair  destabilization  on  binding  affinity  of  UDG.  (A)  Free  energy  diagrams  depicting  the  increasing 
equilibrium  constant  for  uracil  flipping  along  the  reaction  coordinate.  The  equilibrium,  Kn\p  =  [open]/[closed],  describes  uracil  in  a  closed, 
hydrogen-bonded  state  in  the  DNA  duplex  and  an  open  state  that  is  further  along  the  reaction  coordinate  for  base  flipping.  This  flipping 
equilibrium  is  qualitatively  depicted  for  the  free  DNA,  the  two  enzyme-bound  intermediates  in  Figure  IB  (I|,  I2),  and  the  fully  extrahelical 
product  of  the  flipping  reaction  (P).  The  known  structural  and  dynamic  features  of  each  depicted  species  are  listed  below  each  profile.  The 
equilibrium  constant  A'mp  has  been  measured  for  the  free  DNA  (~10~5)  and  for  DNA  bound  in  Ii  (~10-3)  using  NMR  solvent  magnetization 
transfer  methods  at  10  °C  (7),  establishing  that  UDG  increases  the  equilibrium  by  about  100-fold  in  this  early  intermediate.  Less  is  known 
about  the  structure  of  I2,  but  the  present  results  indicate  DNA  bending  and  largely  intact  base  pair  hydrogen  bonds.  In  the  P  state,  the  uracil 
is  held  tightly  in  the  active  site,  and  the  equilibrium  is  pushed  all  the  way  to  the  fully  open  state  (180°  rotation).  Access  to  the  closed  state 
requires  passage  back  over  the  transition  state  connecting  I2  and  P.  The  estimated  rotation  angles  for  the  closed  and  open  states  are  based 
on  the  pseudodihedral  reaction  coordinate  defined  by  Banavali  and  MacKerell  (11,  14),  which  ranges  from  10°  for  uracil  in  B  DNA  to  180° 
for  full  rotation  of  the  base  out  of  the  major  groove.  (B)  Free  energy  effect  of  base  pair  destabilization  on  binding  affinity  of  UDG  (AAGbmd 
=  AAGflex  +  AAG311501111).  The  relative  free  energy  profiles  for  the  stable  UF-D  duplex  and  the  flexible  UF-M  duplex  are  depicted.  The 
increased  flexibility  of  the  UF-M  substrate  decreases  the  free  energy  of  I2  because  less  binding  energy  is  required  to  bend  the  DNA  at  this 
step.  The  magnitude  of  the  flexibility  benefit  is  estimated  as  AAGflex  ~  3  kcal/mol  (see  text).  In  addition  to  the  flexibility  benefit,  the 
absence  of  base  pair  hydrogen  bonds  in  UF‘M  as  compared  to  the  three  hydrogen  bonds  in  UF-D  enhances  binding  of  the  P  state  by 
AAGHbond  ~  3  kcal/mol  (see  Figure  3B  and  text).  This  benefit  arises  because  for  UF-M  there  is  no  energetic  price  for  breaking  these  bonds 
in  the  base  pair.  For  simplicity  we  have  shown  that  AAGfiex  arises  entirely  at  the  I2  state,  but  this  effect  is  more  likely  realized  at  both  the 
I2  and  P  states. 


derived  from  this  significant  energetic  expenditure.  Since  the 
rate  of  spontaneous  base  pair  opening  is  already  fast  com¬ 
pared  to  events  on  the  base  flipping  pathway,  it  is  mecha¬ 
nistically  reasonable  to  propose  that  the  role  of  enzyme- 
induced  DNA  bending  is  to  promote  forward  commitment 
along  the  pathway  rather  than  to  lower  activation  barriers  to 
flipping  (i.e.,  stabilization  of  extrahelical  intermediates).5 


UDG  likely  promotes  forward  migration  of  the  uracil  down 
the  pathway  by  forming  nonspecific  interactions  with  the 


5  Computational  studies  on  UDG  and  other  enzymes  have  suggested 
that  DNA  bending  facilitates  base  flipping  (9—12,  51—55),  and  a  recent 
computational  study  on  Hhal  (cytosine-C5)-methyltransferase  indicates 
that  this  enzyme  acts  by  forming  a  web  of  interactions  with  several 
high-energy  intermediates  along  the  flipping  pathway  (56). 


5958  Biochemistry,  Vol.  44,  No.  16,  2005 


Krosky  et  al. 


DNA  backbone  which  become  progressively  stronger  during 
the  formation  of  two  discrete  intermediates  (Ii  and  I2,  Figure 
IB).  In  the  earliest  nonspecific  intermediate  (Ii),  NMR  imino 
exchange  experiments  indicate  that  UDG  increases  the 
opening  equilibrium  ([open]/[closed])  by  almost  100-fold  (2.8 
kcal/mol),  largely  by  decreasing  the  closing  rate  (7).  In  the 
subsequent  intermediate  (I2),  the  magnitude  of  the  equilib¬ 
rium  is  currently  unknown,  but  a  further  increase  is  suggested 
because  the  uracil  has  substantially  decreased  stacking 
interactions  with  the  adjacent  2-AP  base  arising  from  DNA 
bending.  Since  the  presence  of  uracil  is  required  to  achieve 
I2,  then  this  intermediate  must  be  considered  a  specific 
complex  in  which  the  enzyme  recognizes  some  feature  of 
the  uracil  base.  The  extent  or  nature  of  the  discrimination 
between  uracil  and  other  normal  DNA  bases  at  this  inter¬ 
mediate  step  is  currently  not  known.  In  conclusion,  our  view 
based  on  these  findings  is  that  UDG  facilitates  base  flipping 
by  utilizing  binding  energy  to  alter  the  DNA  structure, 
thereby  progressively  shifting  the  flipping  equilibrium  toward 
the  open  state  that  is  ultimately  trapped  by  specific  interac¬ 
tions  located  within  the  uracil  binding  pocket.  The  specific 
interactions  are  formed  only  very  late  in  the  process,  and 
thus,  the  earlier  steps  in  the  flipping  pathway  are  largely 
driven  by  nonspecific  interactions.  A  further  experimental 
goal  with  UDG  is  to  unambiguously  identify  the  amino  acid 
side  chains  that  promote  the  stepwise  transition  from  the 
closed  to  open  state. 

Conclusion.  We  have  uncovered  the  origins  of  high-affinity 
binding  of  UDG  to  an  extrahelical  uracil  base.  By  discretely 
altering  base  pair  hydrogen  bonding  and  site  flexibility,  we 
have  found  that  high  binding  affinity  is  determined  more  by 
the  flexibility  of  the  site  than  by  the  enthalpic  benefit  of 
removing  base  pair  hydrogen  bonds.  Thus  a  significant 
contribution  to  enzymatic  base  flipping  is  the  unfavorable 
energetic  cost  of  DNA  bending.  This  cost  is  estimated  in 
two  ways:  (i)  the  stiffening  effect  of  pyrene  (3.7  kcal/mol) 
and  (ii)  the  additional  favorable  energetic  effect  on  binding 
of  removing  the  last  hydrogen  bond  from  the  uracil  base 
pair  (~3  kcal/mol).  In  the  future,  these  simple  features  of 
high-affinity  target  site  binding  by  UDG  may  be  utilized  to 
direct  UDG  to  specific  sites  in  genomic  DNA. 
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Abstract:  Uracil  DNA  glycosylase  (UNG)  is  an  important  DNA  repair  enzyme  that  recognizes  and  excises 
uracil  bases  in  DNA  using  an  extrahelical  recognition  mechanism.  It  is  emerging  as  a  desirable  target  for 
small-molecule  inhibitors  given  its  key  role  in  a  wide  range  of  biological  processes  including  the  generation 
of  antibody  diversity,  DNA  replication  in  a  number  of  viruses,  and  the  formation  of  DNA  strand  breaks 
during  anticancer  drug  therapy.  To  accelerate  the  discovery  of  inhibitors  of  UNG  we  have  developed  a 
uracil-directed  ligand  tethering  strategy.  In  this  efficient  approach,  a  uracil  aldehyde  ligand  is  tethered  via 
alkyloxyamine  linker  chemistry  to  a  diverse  array  of  aldehyde  binding  elements.  Thus,  the  mechanism  of 
extrahelical  recognition  of  the  uracil  ligand  is  exploited  to  target  the  UNG  active  site,  and  alkyloxyamine 
linker  tethering  is  used  to  randomly  explore  peripheral  binding  pockets.  Since  no  compound  purification  is 
required,  this  approach  rapidly  identified  the  first  small-molecule  inhibitors  of  human  UNG  with  micromolar 
to  submicromolar  binding  affinities.  In  a  surprising  result,  these  uracil-based  ligands  are  found  not  only  to 
bind  to  the  active  site  but  also  to  bind  to  a  second  uncompetitive  site.  The  weaker  uncompetitive  site  suggests 
the  existence  of  a  transient  binding  site  for  uracil  during  the  multistep  extrahelical  recognition  mechanism. 
This  very  general  inhibitor  design  strategy  can  be  easily  adapted  to  target  other  enzymes  that  recognize 
nucleobases,  including  other  DNA  repair  enzymes  that  recognize  other  types  of  extrahelical  DNA  bases. 


Introduction 

DNA  repair  pathways  have  been  traditionally  viewed  as  the 
cellular  quality  control  machinery  that  preserves  the  coding 
potential  of  genomes.1 2 3 4  However,  there  is  emerging  recognition 
that  the  repair  mechanisms  evolved  to  prevent  accumulation  of 
the  RNA  base  uracil  in  DNA  play  a  much  broader  role  in  a 
number  of  important  areas  of  biomedicine  that  are  divergent 
from  genome  preservation.  Remarkable  examples  include  the 
role  of  the  uracil  excision  repair  machinery  in  the  process  of 
generating  genetic  diversity  during  antibody  maturation  in  B 
cells,2-4  the  importance  of  uracil  incorporation  and  removal  in 
the  life  cycles  of  herpes,5  cytomegalo,6  pox,7’8  and  type  1  human 
immunodeficiency  viruses  (HIV-1),9 10 *  and  the  essential  role  of 
this  pathway  in  generating  pharmacologically  active  single  and 
double  strand  DNA  breaks  during  chemotherapy  treatment  with 
5-flurouracil  and  methotrexate.1011  The  key  enzyme  player  in 
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all  of  these  remarkably  diverse  processes  is  uracil  DNA 
glycosylase  (UNG),  which  cleaves  the  glycosidic  bond  between 
the  uracil  base  and  the  deoxyribose  sugar  in  DNA  by  flipping 
the  uracil  nucleotide  from  the  DNA  duplex  into  the  enzyme 
active  site  (Figure  1A).12  Given  that  UNG  is  emerging  as  a  very 
interesting  pharmacologic  target,  we  have  sought  out  methods 
for  the  rapid  and  efficient  identification  of  small-molecule 
ligands  that  could  inhibit  its  activity.  Although  potent  nucleic 
acid-based  and  proteinaceous  inhibitors  are  available  that  target 
UNG,13-17  there  are  no  small-molecule  inhibitors  for  this 
enzyme,  and  strategies  for  the  discovery  of  such  ligands  are 
lacking. 

One  of  the  most  exciting  potential  applications  of  small- 
molecule  human  UNG  inhibitors  are  as  antiretroviral  agents. 
Recent  findings  have  established  that  HIV-1  specifically 
packages  human  UNG  (hUNG)  into  virus  particles  via  inter¬ 
action  with  the  virus  encoded  integrase  protein  (Int)  or  per¬ 
haps  a  ternary  complex  between  UNG,  Int,  and  the  viral  Vpr 
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Figure  1.  Extrahelical  binding  of  uracil  to  the  UNG  active  site  and  the 
general  strategy  for  uracil-directed  ligand  tethering.  (A)  Structure  of  UNG 
bound  to  uracil  is  shown  (pdb  code  2eug).  The  residue  numbering  is  for 
the  human  enzyme.  (B,  C)  The  uracil  ligand  (U)  that  targets  the  UDG  active 
site  is  covalently  tethered  to  two  different  ligands  that  can  interact  with 
distinct  binding  surfaces  near  the  active  site. 


protein. 5>18_25  hUNG  is  required  for  infection  of  nondividing 
cells  such  as  macrophages  and  resting  T  cells  and  helps  maintain 
a  viral  reservoir  in  the  host  that  is  crucial  for  virus  spread  to 
the  lymphoid  organs  and  T-helper  lymphocytes  and,  ultimately, 
AIDS  pathogenesis.20-26  UNG  is  apparently  recruited  to  mini¬ 
mize  uracil  incorporation  into  the  viral  genome  in  these  cells, 
which  have  naturally  high  levels  of  dUTP,  a  good  substrate  for 
the  viral  reverse  transcriptase.27  In  the  absence  of  UNG,  the 
HIV-1  mutation  rate  is  found  to  increase  by  18-fold  resulting 
in  extremely  inefficient  virus  replication  in  nondividing  cells,20 
and  the  virus  particles  produced  from  UNG-depleted  cells  are 
incapable  of  infecting  new  target  cells.9’28  Pharmacologic  target¬ 
ing  of  a  human  enzyme  required  for  virus  infectivity  is  extremely 
attractive  because  such  a  target  would  not  be  susceptible  to  the 
same  high  mutagenesis  rate  and  resulting  drug  resistance  as  viral 
encoded  proteins.29  Targeting  the  human  enzyme  is  a  viable 
therapeutic  strategy  because  it  is  not  an  essential  enzyme.  Thus, 
UNG  knock-out  mice  display  no  remarkable  phenotype,  nor  do 
UNG  null  yeast  or  human  cell  lines.30 
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Herein,  we  report  an  integrated  high-throughput  (HTP) 
platform  for  discovering  small-molecule  ligands  that  inhibit 
UNG.  The  strategy  takes  advantage  of  the  extrahelical  uracil 
recognition  mechanism  of  UNG  by  using  the  specificity  and 
binding  energy  of  a  uracil  ligand  to  target  the  UNG  active 
site14,31,32  and  then  covalent  tethering  of  random  functional 
groups  for  exploration  of  nearby  binding  pockets  (Figure  IB). 
Library  members  can  be  rapidly  screened  using  a  robust  HTP 
activity  assay,  and  initial  hits  are  quickly  optimized  using 
subsequent  structure— activity  studies.  This  tethering  approach, 
which  uses  efficient  oxime  chemistry  (Figure  2),  is  related  to 
the  “combinatorial  target-guided  ligand  assembly”  method  of 
Ellman  et  al.33  but  differs  in  that  the  uracil  ligand  specifically 
targets  the  active  site  rather  than  irrelevant  regions  of  the 
enzyme.  Thus,  the  hit-rate  and  binding  affinities  of  early  hits 
are  higher  than  the  more  random  approach  of  Ellman  and 
colleagues.  This  synthetic  and  screening  strategy  should  be 
easily  adaptable  for  the  discovery  of  inhibitors  of  other  enzymes 
that  recognize  extrahelical  bases  in  DNA  or  free  nucleosides. 

Results  and  Discussion 

Synthesis  of  Uracil-Tethered  Oxime  Libraries  and  Gen¬ 
eral  Strategy.  We  sought  an  inhibitor  development  strategy  that 
allowed  rapid  and  economical  synthesis  of  small-molecule 
ligands  that  explore  binding  sites  near  the  UNG  active  site  and 
which  could  be  used  directly  in  HTP  screening  applications 
without  purification.  One  efficient  synthesis  strategy  that  meets 
these  criteria  is  outlined  in  Figure  2.  First,  flexible  diaminoal- 
kanediol  linkers  of  variable  length  are  synthesized  from  the 
corresponding  dibromoalkanes  (Figure  2A).  Then  the  linkers 
are  used  to  tether  uracil  aldehyde  binding  elements  (1—3)  to  a 
library  of  aldehyde  binding  elements  (RCHO)  via  the  formation 
of  stable  oxime  linkages  (Figure  2B).  Each  tethering  reaction 
is  carried  out  in  one  well  of  a  96-well  microtiter  plate  that 
contains  one  equivalent  uracil  aldehyde,  one  equivalent  RCHO 
library  member,  and  a  mixture  of  diaminoalkanediol  linkers  (n 
=  2—6).  The  reactions  typically  proceed  to  85—99%  completion 
after  overnight  incubation  (DMSO  solvent,  37  °C)  and  produce 
a  1:2:1  statistical  mixture  of  the  homodimeric  (UaU,  RaR)  and 
heterodimeric  (UaR)  oximes  for  each  of  the  five  linker  lengths 
present  (see  Experimental  Section  and  Supporting  Imformation 
Figure  SI).  Although  two  geometric  configurations  are  possible, 
oxime  derivatives  with  bulky  substituents  are  generally  found 
to  be  >95%  in  the  trans  configuration.34  The  unpurified  oxime 
mixtures  were  directly  screened  for  inhibition  of  UNG  at  ~  100 
«M  total  oxime  concentration  to  ensure  that  each  component 
in  the  mixture  is  present  at  a  concentration  in  the  range  5—10 
/<M.  If  significant  inhibition  is  observed  by  any  mixture,  the 
linker  length  and  RCHO  binding  element  that  gave  rise  to  the 
inhibition  can  be  identified  by  resynthesis  of  the  individual 
oximes  using  a  single  linker  length  in  each  reaction  (see  below). 

An  important  aspect  of  this  approach  is  that  the  uracil 
homodimers  present  in  some  reaction  mixtures  are  inhibitory 
even  in  the  absence  of  any  active  heterodimer.  For  instance. 
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Figure  2.  Synthesis  of  oxime  libraries  based  on  uracil  and  RCHO:  (A)  synthesis  of  diaminoalkanediol  tethers  of  variable  length;  fB)  construction  of  the 
uracil— oxime  library  based  on  the  uracil  aldehydes  (1—3)  and  a  series  of  aldehyde  compounds  (RCHO).  The  products  consist  of  a  1:2:1  mixture  of  the 
heterodimer  (UaR)  and  the  two  homodimers  (UaU  and  RaR)  connected  via  alkane  linkers  of  lengths  2—6.  A  1  equiv  amount  of  total  diaminoalkanediol 
is  added  to  each  reaction.  Each  linker  length  is  present  at  one-fifth  of  the  total  concentration. 


the  purified  homodimers  of  various  lengths  that  are  based  on 
6-formyluracil  (3)  give  rise  to  about  22%  inhibition  in  all  the 
mixtures  based  on  3  under  the  screening  conditions  (not  shown). 
In  contrast,  the  homodimers  of  1  and  2  show  no  detectable 
inhibition  under  the  same  conditions.  Thus,  the  screening  assay 
must  be  robust  enough  to  detect  any  additional  inhibition 
resulting  from  an  active  heterodimer  in  the  mixture.  Spectro¬ 
scopic  results  for  determining  the  purity  and  composition  of 
representative  reaction  mixtures  are  available  (see  Supporting 
Information). 

High-Throughput  Screening  of  Uracil— Oxime  Libraries. 

To  test  this  directed  library  approach,  we  tethered  the  three  uracil 
aldehydes  (1—3)  shown  in  Figure  2  to  14  aldehyde  binding 
elements  (RCHO)  using  the  variable-length  diaminoalkanediol 
linkers  (see  Supporting  Information  Table  SI  for  RCHO 
structures).  This  library  of  uracil-linked  binding  elements  was 
screened  for  inhibition  of  hUNG  using  a  high-throughput 
molecular  beacon  activity  assay  (Figure  3). 35  In  this  assay,  one 
DNA  strand  is  labeled  with  a  fluorescent  5'-FAM  and  the 
complementary  strand  is  modified  with  a  3'-dabsyl  moiety  that 
serves  to  efficiently  quench  the  fluorescence  of  the  FAM  group 
through  contact  quenching.  To  increase  stability,  the  two  DNA 
strands  are  linked  in  a  hairpin  configuration  using  an  18  atom 
poly(ethylene  glycol)  linker.  When  the  substrate  DNA  is 
exposed  to  UDG,  multiple  uracils  are  removed,  and  eventually 
the  two  paired  strands  of  the  hairpin  spontaneously  separate, 
thus  removing  the  dabsyl  quencher  from  the  proximity  of  the 
FAM  group  and  resulting  in  a  6-fold  increase  in  the  fluorescence 
of  the  system  (Figure  3A).  Under  the  assay  conditions,  the 
hairpin  DNA  substrate  has  a  Km  =  164  ±  10  nM  and  kcat  = 
0.33  ±  0.01  s_1  (Figure  3B).  To  enhance  detection  of  competi¬ 
tive  inhibitors  during  HTP  screening  we  employed  a  molecular 
beacon  substrate  concentration  equivalent  to  1/3 Km  (50  nM). 
Representative  HTP  screening  results  for  several  inactive  and 
active  oxime  mixtures  are  shown  in  Figure  4  ([total  oxime]  = 
100  /(M). 

(35)  Kwon,  K.;  Nagarajan,  R.;  Stivers,  J.  T.  Biochemistry  2004,  43,  14994— 
15004. 
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Figure  3.  High-throughput  (HTP)  UDG  kinetic  assay.  (A)  The  HTP  assay 
relies  on  molecular  beacon  technology.  Excision  of  multiple  uracil  bases 
by  the  enzyme  destabilizes  the  hairpin  structure  thereby  releasing  the  5'- 
FAM  fluorophore  from  the  quenching  effects  of  the  3'-dabsyl  group.  (B) 
Steady-state  kinetic  analysis  is  shown  of  the  hUDG  reaction  using  the 
molecular  beacon  hairpin  substrate. 

Several  activity  trends  emerged  immediately  from  the  screen¬ 
ing  results  shown  in  Figure  4.  First,  none  of  the  mixtures  derived 
from  the  uracil  N1 -acetaldehyde  binding  element  (1)  were 
inhibitory  at  the  concentration  used  in  the  screen.  In  addition, 


J.  AM.  CHEM.  SOC.  C 


ARTICLES 


Jiang  et  al. 


MnV* 


Figure  4.  Representative  HTP  screening  results  using  the  molecular  beacon 
substrate.  (A)  Screen  of  oxime  dimer  mixtures  derived  from  uracil  aldehyde 
1  and  aryl  aldehydes  13—17  is  presented.  No  inhibition  was  observed  for 
any  oxime  derived  from  1  regardless  of  linker  length  (n ).  (B)  Screen  of 
oxime  dimer  mixtures  derived  from  uracil  aldehyde  3  and  aryl  aldehydes 
13—17  is  shown.  The  mixed  oxime  derived  from  3  and  13  shows  significant 
inhibition,  and  this  derivative  was  further  optimized.  For  14—17,  the 
observed  inhibition  represents  that  from  the  3—3  homodimers  that  are 
present  in  the  mixtures. 

none  of  the  UaU  homodimers  derived  from  2  were  found  to 
be  inhibitory,  nor  were  any  of  the  RaR  homodimers  regardless 
of  the  linker  length.  (Inhibition  by  the  homodimers  is  automati¬ 
cally  assessed  because  these  are  present  in  multiple  reaction 
mixtures.)  In  contrast,  one  oxime  mixture  derived  from  uracil 
aldehydes  2  and  3  and  RCHO  =  3,4-dihydroxybenzaldehyde 
(13)  showed  inhibitory  activity  in  the  range  15—100%,  indicat¬ 
ing  that  active  heterodimers  were  present.  The  structures  of  the 
active  heterodimers  present  in  these  two  oxime  mixtures  are 
shown  at  the  top  of  Table  1. 

The  two  active  mixtures  were  deconvoluted  with  respect  to 
linker  length  by  individually  synthesizing  each  oxime  dimer 
using  a  single  diaminoalkanediol  linker/reaction  (Table  1).  At 
this  stage  we  did  not  separate  the  homodimers  from  the  active 
heterodimers  in  the  mixtures.  For  the  oxime  dimers  derived  from 
5-formyluracil  (2)  and  3,4-dihydroxybenzaldehyde  (13),  a  broad 
dependence  on  linker  length  was  observed  with  length  n  —  2 
being  most  favorable  for  inhibitory  activity  (i.e.  mixed  oxime 
2-(2)-13,  Chart  1).  In  contrast,  a  very  stringent  linker  length  of 
n  =  3  was  required  for  maximal  inhibitory  activity  with  the 
oxime  mixture  derived  from  6-formyluracil  (3)  and  3,4- 
dihydroxybenzaldehyde  (13)  to  form  mixed-oxime  3-(3)-13 
(Chart  1).  To  confirm  these  results,  2-(2)-13  and  3-(3)-13  were 
separated  from  their  respective  homodimers  using  reversed  phase 
HPLC  (see  Methods),  and  the  concentration  dependence  of 
inhibition  was  determined.  The  measured  IC50  values  for  2-(2)- 
13  and  3-(i)-13  were  5.8  and  1.1  u M ,  respectively  (Figure  5). 

Structure— Activity  Relationships.  In  an  effort  to  find  more 
potent  inhibitors  based  on  the  3-(.?)-13  scaffold,  25  commercially 


Table  1.  Structures  of  Active  Heterodimers  and  Dependence  of 
Inhibition  on  Linker  Length3 


2-fnJ-U  3»13 


mixture 

linker  length  (n) 

%  inhibition 

2-(n)-13 

2 

50 

3 

40 

4 

20 

5 

20 

6 

15 

3-(n)-13 

2 

57 

3 

too 

4 

51 

5 

48 

6 

48 

°  Reactions  were  performed  in  the  presence  of  100  /uM  oxime  mixture 
and  50  nM  substrate  concentration. 


Chart  1.  Heterodimer  Oximes  Identified  from  Deconvolution  of  an 
Active  Mixture 


2-0-13 


3-0-13 

available  benzaldehyde  precursors  were  purchased  (18—42;  cf. 
Supporting  Information  Table  S2).  The  HTP  screen  was  then 
performed  on  this  set  of  oxime  mixtures  (3 -(3)-R)  in  an  identical 
fashion  as  described  above.  This  structure— activity  study 
established  that  the  3-  and  4-hydroxyl  groups  of  3-(3)-13  were 
essential  for  activity  because  alkylation  or  halogen  substitution 
at  these  positions  had  a  substantial  deleterious  effect  on 
inhibitory  activity  (see  Supporting  Information  Table  S2).  Thus, 


log  [I]  (pM) 

Figure  5.  IC5o  analysis  for  2-(2)-13  (▼),  3-(S)-13  (A),  and  3-(.?)-27  (■). 
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Table  2.  Inhibitory  Activity  for  Structural  Variants  of  3-(3)-27a 


3  -(3)-R 

RCHO  = 


27  43  44  45  46 


IC50(gM)  2-R 


variant 

IC50  (urn) 

2-R 

3-(3)-27 

0.26 

OH 

3-(3)-43 

2.7 

F 

3-(3)-44 

16 

Cl 

3-(3)-45 

40 

Br 

3-(3)-46 

40 

no2 

“  The  concentration  dependence  of  inhibition  was  determined  using  50 
nM  substrate. 


hydrogen  bond  donating  groups  at  the  3-  and  4-positions  of  the 
benzyl  ring  appear  to  be  essential. 

One  compound  in  this  series  with  an  additional  hydroxyl 
group  at  the  2-position  of  the  benzyl  ring  (3-(i)-27)  showed  a 


3-(3)-27 


3-fold  greater  potency  than  3-(3)-13  (Figure  5,  ■)  (IC50  =  0.3 
/rM).  To  further  investigate  SARs  based  around  the  3-(i)-27 
scaffold,  we  synthesized  four  more  3,4-dihydroxybenzaldehyde 
analogues  (43—46,  Table  2),  where  the  substituent  at  the 
2-position  was  varied  (R  =  F,  Cl,  Br,  or  NO2).  Within  this  series 
there  was  a  strong  trend  correlating  with  atomic  size  for  the 
halogens,  with  the  smaller  fluorine  substituent  binding  16-fold 
more  tightly  than  bromine.  However,  no  substituent  in  this  series 
was  more  effective  than  the  2-hydroxyl  group.  In  conclusion, 
the  binding  pocket  for  the  2-substituent  favors  a  hydrogen  bond 
donating  group  with  a  van  der  Waals  radius  smaller  than 
chlorine. 

Inhibition  Mechanisms  of  3-(3)-27,  2-(2)-13,  and  Uracil. 

Although  the  uracil-directed  ligand  tethering  strategy  is  expected 
to  produce  competitive  inhibitors  of  UNG,  we  thoroughly 
investigated  whether  this  assumption  was  true.  The  detailed 
mode  of  inhibition  by  3-(3)-27  and  2-(2)-13  was  evaluated  by 
varying  both  substrate  and  inhibitor  concentrations  (Figure 
6A,B).  Standard  double  reciprocal  plots  of  l/fc0bsd  against 
1/[DNA]  at  increasing  concentrations  of  3-(3)-27  showed  no 
significant  intercept  effects  establishing  a  competitive  aspect 
to  the  inhibition  (Figure  6A).  However,  a  secondary  plot  of  the 
Line  weaver— Burk  slopes  against  [3-(3)-27]  showed  a  parabolic 
response  consistent  with  the  presence  of  at  least  two  inhibitor 
binding  sites  (Figure  6A,  inset).36  Global  discrimination  fitting 
of  the  inhibition  data  by  computer  simulation  with  the  program 
Dynafit  using  competitive,  noncompetitive,  uncompetitive, 
mixed-type,  two-site  competitive— noncompetitive,  and  two-site 
competitive— uncompetitive  inhibition  mechanisms  unambigu- 

(36)  Segel,  I.  H.  Enzyme  Kinetics',  John  Wiley  &  Sons:  New  York,  1993;  Chapter 
8,  pp  465-504. 
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Figure  6.  Mode  of  inhibition  analysis,  presenting  double  reciprocal  plots 
and  secondary  slope  and  intercept  replots  for  inhibition  by  increasing 
concentrations  of  (A)  3-(3)-27,  (B)  2-(2)-13,  and  (C)  uracil.  Slope  and 
intercept  effects  in  the  inset  to  (C)  are  shown  as  squares  and  triangles, 
respectively. 

ously  confirmed  the  presence  of  two  inhibitory  binding  sites 
for  3-(3)-27  (see  Supporting  Information).37  Simulations  clearly 
indictated  that  the  first  tight  site  is  competitive  with  respect  to 
substrate.  Although  the  simulations  indicated  a  slight  statistical 
advantage  for  a  partial  mixed-type  inhibition  mode  for  the 
second  weaker  site,  it  was  difficult  to  eliminate  an  uncompetitive 
mode  for  this  site.  Using  the  criterion  of  Occam’s  razor,  the 
inhibition  parameters  for  3-(3)-27  are  reported  in  Table  3  using 
the  simulation  results  for  the  competitive— partial  uncompetitive 
mechanism  (Scheme  1). 

Like  its  6-substituted  analogue,  initial  inspection  of  the 
Lineweaver— Burk  analysis  of  2-(2)-13  indicates  mixed-type 
inhibition  with  a  strong  preference  for  binding  to  the  free 
enzyme  (i.e.  slope  effects,  Figure  6B).  However,  in  contrast  to 
3-(_?)-27,  the  secondary  plot  of  the  Lineweaver— Burk  slopes 
versus  2-(2)-13  concentration  is  hyperbolic,  indicating  that 
binding  of  2-(2)-13  results  in  partial  inhibition  (Figure  6B, 
inset).36  Because  binding  to  the  active  site  would  result  in 
complete  inhibition,  2-(2)-13  most  likely  binds  to  the  noncom¬ 
petitive  site  observed  for  3-(_?)-27.  Global  discrimination  fitting 
of  the  inhibition  data  by  computer  simulation  confirmed  this 
inhibition  mechanism  (Scheme  1)  and  provided  the  inhibition 

(37)  Kuzmic,  P.  Anal.  Biochem.  1996,  237,  260-273. 
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Table  3.  Inhibition  Constants  for  Uracil  and  Its  Derivatives3 

param 

3-(3)-27 

2-(2)-13 

uracil 

51 

K \  (mM) 

0.19  ±0.02 

0.23  ±  0.03 

0.23  ±  0.02 

0.16  ±0.01 

kads-1) 

0.41  ±0.01 

0.50  ±  0.02 

0.47  ±  0.01 

0.33  ±  0.01 

feat  (s_I) 

0.16  ±0.04 

0.012  ±  0.02 

0.06  ±  0.01 

Kc  (mM) 

0.32  ±  0.02 

80  ±7 

45  ±2 

K„  (mM) 

2.8  ±0.1 

K„c  (mM) 

1.2  ±0.2 

300  ±  55 

IGT/tM) 

1  ±0.3 

125  ±  46 

104  ±7 

mode  of  inhibitn 

two  sites,  competitive. 

one  site, 

two  sites,  competitive. 

one  site, 

partial  uncompetitive 

partial  mixed-type 

partial  uncompetitive 

competitive 

°  Parameters  correspond  to  the  mechanisms  shown  in  Scheme  1 .  Kc  and  Kn  represent  dissociation  constants  for  inhibitor  binding  sites  that  are  competitive 
and  noncompetitive  with  substrate,  respectively.  Knc  and  ATns  represent  the  dissociation  constants  for  inhibitor  binding  to  the  noncompetitive  site  when  the 
active  site  is  occupied  by  the  competitively  bound  inhibitor  or  substrate,  respectively.  In  these  simulations  the  Michaelis— Menten  parameters  for  the  substrate 
were  fixed  using  values  from  nonlinear  regression  fits  (Figure  6).  Other  parameters  were  obtained  from  simulations  to  the  data  using  the  program  Dynafit 
(cf.  Supporting  Information). 


Scheme  1.  Inhibition  Mechanisms  for  3-(3)-27  and  2-(2)-13  and 
Uracil3 

_  Kc  _ 

1  +  s  +  21  +  s  +  C'V/I'?  +  i 

K"  l  A  |  !  k; 

s  +  ^  i  +  s  +  ==  '^C  +  P 

“  Only  3-(i)-27,  2-(2)-13,  and  uracil  have  mechanisms  that  include  the 
&cat'  step.  The  mechanisms  for  3-(i)-27  and  uracil  do  not  include  the 
equilibrium  constant  Kn,  and  the  mechanism  for  2-(2)-13  does  not  include 
the  equilibria  Kc  or  K„c. 

constants  reported  in  Table  3.  These  observations  strongly 
indicate  that  2-(2)-13  binds  to  a  site  distinct  from  the  active 
site,  although  DNA  binding  is  strongly  antagonistic  to  inhibitor 
binding  (Table  3).  In  summary,  the  inhibition  mechanisms  of 
3-(3)-27  and  2-(2)-13  indicate  that  two  inhibition  modes  exist 
for  these  uracil  derivatives:  one  mode  competitively  targets  the 
active  site,  and  the  second  weaker  mode  is  noncompetitive  or 
uncompetitive  with  respect  to  substrate  binding.  These  data, 
quite  surprisingly,  suggested  the  presence  of  two  uracil  binding 
sites  on  human  UNG. 

To  further  investigate  the  interesting  possibility  of  two  uracil 
binding  sites  on  UNG,  we  performed  a  mode  of  inhibition 
analysis  for  uracil  itself  (Figure  6C).  In  confirmation  of  this 
initial  expectation,  inhibition  by  uracil  involves  two  sites.  The 
first  site  is  competitive,  and  the  second  is  partially  uncompeti¬ 
tive.  Accordingly,  the  Lineweaver— Burk  slope  replot  was 
slightly  parabolic  indicating  that  inhibition  involved  binding  of 
more  than  one  molecule  of  uracil,  and  the  intercept  replot  was 
hyperbolic  indicating  a  partial  uncompetitive  mode.  These 
characteristics  of  the  inhibition  by  uracil  combine  the  features 
observed  for  3-(3)-27  and  2-(2)-13  and  establish  that  the  two 
site  binding  of  3-(5)-27  is  not  attributable  to  the  trihydroxy- 
benzaldoxime  moiety  but,  instead,  arises  from  the  uracil 
functionality  itself. 

Implications  for  Two  Uracil  Binding  Sites.  Why  would 
UNG  have  a  second  uracil  binding  site?  Although  the  answer 
to  this  question  cannot  be  firmly  established  by  inhibition  data 
alone,  an  intriguing  role  for  this  site  during  the  mechanism  of 
uracil  base  flipping  is  supported  by  several  different  experi¬ 
mental  findings.  First,  kinetic  experiments  following  the  pathway 
of  uracil  flipping  from  duplex  DNA  have  detected  a  weakly 
bound  intermediate  state  of  uracil  that  precedes  its  attainment 
of  the  final  extrahelical  state  seen  in  the  crystal  structure  (Figure 
1A).15’38-40  Solution-  and  solid-state  NMR  studies  of  uracil 
flipping  support  the  existence  of  a  weak  uracil  binding  site 


because  UNG  is  found  to  transiently  stabilize  thymine  and  other 
uracil  congeners  in  an  extrahelical  conformation,  without  these 
bases  gaining  full  access  to  the  uracil  active  site  pocket.4142 
Relevant  to  these  observations,  the  crystal  structure  of  herpes¬ 
virus  UDG  bound  to  pTTTp  shows  that  the  5'  T  is  bound  in  the 
mouth  of  the  active  site  pocket  in  a  manner  that  is  consistent 
with  a  transient  state  on  the  pathway  for  base  flipping  of  uracil.43 
Finally,  the  crystal  structure  of  another  base-flipping  DNA  repair 
enzyme,  human  8-oxoguanine  DNA  glycosylase,  suggests  that 
this  related  enzyme  can  flip  the  normal  base  guanine  into  a 
discrimination  pocket  that  was  distinct  from  the  active  site 
pocket  that  only  accommodates  8-oxoguanine.44  These  combined 
data  provide  a  compelling  case  for  a  generalized  pathway  for 
base  flipping  involving  transient  enzyme  stabilization  of  at  least 
one  extrahelical  intermediate  state  before  the  base  is  docked 
into  the  active  site.  On  the  basis  of  the  observation  that  2-(2)- 
13  is  excluded  from  the  active  site  but  that  3-(3)-27  and  uracil 
can  occupy  both  sites,  we  surmise  that  the  relative  binding 
affinities  for  each  site  might  depend  on  the  bulkiness  of  the 
substituent  at  the  5-position  of  uracil.  In  other  words,  uracil 
congeners  with  small  substituents  at  the  five  position  (such  as 
hydrogen  in  the  case  of  3-(3)-27)  would  favor  binding  to  the 
active  site  and  uracil  derivatives  with  bulkier  substituents  (such 
as  the  dihydroxybenzaldoxime  of  2-(2)-13)  would  be  sterically 
excluded  from  the  active  site  but  could  gain  access  to  the  weaker 
less  selective  site.  Indeed,  it  is  well-known  that  the  active  site 
of  UNG  uses  the  bulky  side  chain  of  a  tyrosine  to  exclude 
thymidine  (5-methyluracil),14’45-47  yet  6-substituted  uracil  de¬ 
rivatives  such  as  3-(3)-27  have  been  generally  observed  to  bind 
to  the  active  site.14  Thus,  the  uracil-based  inhibitors  found  here 
have  revealed  a  possible  pyrimidine  discrimination  site  that  may 
be  employed  during  the  multistep  extrahelical  uracil  recognition 
mechanism.  It  should  be  noted  that  the  noncompetitive  inhibition 
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J.  A.;  Krokan,  H.  E.  EMBO  J.  1996.  15,  3442-3447. 

(46)  Mol,  C.  D.;  Arvai,  A.  S.;  Slupphaug,  G.;  Kavli,  B.;  Alseth,  I.;  Krokan,  H. 
E.;  Tainer,  J.  A.  Cell  1995,  80,  869-878. 
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Chart  2.  0-Methyl  Oxime  Derivatives  of  the  Aldehyde  Binding 
Elements  of  2-(2)-13  and  3-(3)-27 


mode  for  2-(2)-13  requires  that  the  final  extrahelical  state  can 
be  attained,  albeit  inefficiently,  even  when  the  transient  uracil 
binding  site  is  occupied  by  the  inhibitor.  In  contrast,  the  partial 
uncompetitive  mechanism  for  binding  of  3-(3)-27  to  its  second 
site  does  not  present  the  same  apparent  discrepancy,  because, 
for  uncompetitive  inhibition,  the  compound  binds  after  the 
substrate  is  fully  inside  the  active  site  pocket  (see  above). 

Inhibition  by  the  Untethered  Parts.  It  is  of  interest  to  ask 
how  well  uracil-directed  ligand  tethering  has  performed.  To 
dissect  the  energetic  contributions  of  the  formyluracil  and 
hydroxybenzaldoxime  binding  elements  of  3-(i)-27  and  2-(2)- 
13,  we  synthesized  the  methyl  oxime  derivatives  of  aldehydes 
2,  3,  27,  and  13  as  shown  in  Chart  2.  These  methyl  oxime 
derivatives  are  reasonable  mimics  of  the  two  individual  binding 
elements  and  in  principle  could  provide  an  energetic  analysis 
of  the  binding  affinities  of  the  two  separate  elements.  If  the 
sum  of  the  binding  energies  of  each  element  equals  the  entire 
binding  free  energy  of  the  whole  tethered  molecule,  then  it  may 
be  concluded  that  (i)  the  tether  is  energetically  inert  with  respect 
to  binding  and  (ii)  the  binding  of  one  element  does  not  affect 
the  other  by  induced  strain  or  forcing  a  tighter  fit.  If  the  whole 
tethered  molecule  binds  much  more  weakly  or  tightly  than 
expected  from  the  summation  of  the  binding  free  energies  of 
the  two  individual  binding  elements,  then  nonadditive  energetic 
effects  are  present.  Such  effects  would  indicate  either  an 
energetic  penalty  for  tethering  (antagonistic  binding  of  the  parts) 
or,  alternatively,  a  nonadditive  energetic  benefit  (synergistic 
binding  of  the  parts).48-49 

Comparison  of  the  binding  affinity  of  3-(3)-27  to  its  competi¬ 
tive  site  (.Kc3-^-27  =  0.32  /(M)  with  that  of  the  6-formyluracil 
O-methyl  oxime  binding  element  alone  (51)  allows  estimation 
of  the  free  energy  benefit  of  tethering  the  trihydroxybenzal- 
doxime  binding  element  to  the  6-formyluracil  oxime  part. 
Conversely,  comparison  of  the  binding  affinity  of  3-(3)-27  with 
that  of  the  trihydroxybenzaldoxime  O-methyl  ether  (48)  allows 
estimation  of  the  free  energy  benefit  of  tethering  the  6-formyl¬ 
uracil  oxime  binding  element  to  the  trihydroxybenzaldoxime 
part.  The  6-formyluracil  O-methyl  oxime  51  shows  a  cleanly 
competitive  mode  of  inhibition  with  Kcsl  =  45  +  2  «M  (Table 
4,  data  not  shown).  Thus,  the  enhancement  in  the  free  energy 
of  binding  upon  addition  of  the  trihydroxybenzaldoxime  (THB) 
part  to  the  6-formyluracil  oxime  element  is  AAGTHB  =  —RT 
\n{K?~^21  IK^)  =  —3  kcal/mol.  We  were  unable  to  perform 
a  similar  energetic  analysis  with  the  trihydroxybenzaldoxime 
O-methyl  ether  (48)  due  to  its  extremely  weak  binding  (9% 


(48)  Page,  M.  I.;  Jencks,  W.  P.  Proc.  Natl.  Acad.  Sci.  U.S.A.  1971.  68,  1678- 
1683. 

(49)  Jencks,  W.  P.  Binding  Energy,  Specificity,  and  Enzymic  Catalysis:  The 
Circe  Effect.  In  Catalysis  in  Chemistry  and  Enzmology,  Dover  Publica¬ 
tions:  New  York,  1987;  pp  615—807. 


inhibition  at  1  mM  concentration,  data  not  shown).  Similarly, 
an  energetic  analysis  of  the  binding  elements  comprising  2-(2)- 
13  was  not  possible  because  of  the  extremely  weak  inhibition 
by  the  5-formyluracil  O-methyl  oxime  (50)  and  the  dihydroxy- 
benzaldoxime  O-methyl  ether  (47).  Nevertheless,  the  140-fold 
greater  binding  affinity  of  3-(3)-27  as  compared  to  the  6-formy¬ 
luracil  O-methyl  oxime  binding  element  (51)  alone  indicates 
that  a  large  benefit  can  be  derived  from  tethering.50 

Experimental  Section 

Reagents  and  General  Methods.  All  chemicals  were  purchased 
from  commercial  sources  without  further  purification  unless  otherwise 
stated.  The  'H,  13C,  and  19F  NMR  spectra  were  recorded  on  a  400  MHz 
Varian  Innova  instrument.  The  spectra  were  recorded  in  deuteriochlo- 
roform  (CDCT,)  or  in  hexadeuteriodimethyl  sulfoxide  (DMSO-*).  The 
chemical  shifts  of  protons  are  given  in  ppm  with  TMS  as  internal 
standard.  The  chemical  shifts  of  carbons  are  obtained  in  ppm  with 
solvents  as  internal  standards.  That  of  fluorine  is  given  in  ppm  with 
1%  trifluoroacetic  acid  in  DMSO-*  as  an  external  standard.  Most  of 
oximes  were  purified  by  HPLC  using  aqueous  triethylammonium 
acetate  (TEAA)  as  a  running  buffer.  Therefore,  TEAA  was  not 
completely  removed  and  it  appeared  in  the  NMR  spectra.  Accordingly, 
proton  and  carbon  chemical  shifts  of  TEAA  were  not  listed  during  the 
characterizations  of  the  oximes.  During  the  purification  of  the  oxime 
3-(5)-27,  2-mercaptoethanol  was  used  as  an  antioxidant.  Therefore, 
small  amounts  of  this  compound  and  its  oxidation  product  are  also 
present  in  the  oxime  3-(i)-27.  Flash  chromatographies  were  performed 
with  silica  (70—230  mesh  from  Sorbent  Technologies)  and  monitored 
by  thin-layer  chromatography  (TLC)  with  silica  plates  (Merck,  Kieselgel 
60  F254). 

Synthesis  of  Alkyl  Hydroxyamines.  0,0'-Diaminoalkanediol  link¬ 
ers  of  variable  length  (ethyl,  propyl,  butyl,  pentyl,  hexyl)  were  prepared 
from  the  corresponding  dibromoalkanes  in  two  steps  according  to 
literature  procedures  (Figure  2A).33-51-52 

General  Synthesis  of  Tethered  Oxime  Dimers.  A  set  of  14  aryl 
aldehydes  (4—17;  cf.  Supporting  Information  Table  SI)  was  selected 
for  library  synthesis  for  coupling  to  the  three  uracil  containing  aldehydes 
(1—3,  Figure  2)  using  the  0,O'-diaminoalkanediol  linkers  as  follows. 
To  each  0.5-mL  well  of  a  Matrix  microtiter  plate  was  added  a  DMSO 
stock  solution  of  AcOH  (20  /(L,  150  mM,  3  /(mol),  uracil  aldehyde 

1—3  (20  [iL ,  150  mM,  3  /(mol),  and  a  single  aryl  aldehyde  (20  /(L, 
150  mM,  3  /(mol).  The  plate  was  carefully  agitated  to  make  the  solutions 
homogeneous.  To  each  of  the  uracil— aryl  aldehyde  mixture  was  added 
a  DMSO  solution  of  the  0,O'-diaminoalkanediol  linkers  containing  each 
of  the  five  linker  lengths  in  equal  proportion  (22  /(L,  150  mM,  3.3 
/tmol  total  amine  equivalents).  The  plate  was  sealed,  further  agitated, 
and  incubated  in  an  oven  for  12  h  at  37  °C. 

The  most  potent  inhibitors  from  this  first  screen  2-(2)-13  and  3-(3)- 
13)  were  synthesized  in  larger  scale  and  thoroughly  characterized  after 
HPLC  purification  of  the  heterodimers  as  follows. 

2- (2)-13:  ‘H  NMR  (400  MHz,  DMSO-*)  6  8.05  (s,  1  H),  7.91  (s, 
1  H),  7.78  (s,  1  H),  7.04  (s,  J  =  2.4  Hz,  1  H),  6.86  (m,  1  H),  6.74  (d, 
J  =  8.0  Hz,  1  H),  4.26  (s,  1  H);  13C  NMR  (100  MHz,  DMSO-*)  <5 
162.40,  151.04,  149.25,  147.92,  145.75,  142.74,  140.66,  123.05,  119.88, 
115.74,  113.10,  104.31,71.82,71.54;  UV/vis  2max  275  nm;  HRMS  {ml 
z)  [M  +  Na]+  calcd  for  CuH^OgNa  357.08,  found  357.08. 

3- (3)-13:  ‘H  NMR  (400  MHz,  DMSO-*)  (5  9.10  (bs,  H),  8.01  (s,  1 
H),  7.94  (s,  1  H),  7.04  (d,  J  =  1.6  Hz,  1  H),  6.82  (d,  J  =  7.6  Hz,  1  H), 
6.74  (d,  J  =  7.6  Hz,  1  H),  5.78  (s,  1  H),  4.26  (t,  J  =  6.8  Hz,  2  H),  4. 12 
(t,  J  =  6.0  Hz,  2  H),  2.06  (t,  J  =  6.8  Hz,  2  H);  13C  NMR  (125  MHz, 


(50)  The  uracil  N1 -acetaldehyde  O-methyl  oxime  (49)  showed  undetectable 
inhibition  (Kc  >  10  mM). 

(51)  Rung,  P.  P.;  Bharadwaj,  R.;  Fraser,  A.  S.;  Cook,  D.  R.;  Kawasaki,  A.  M.; 
Cook,  P.  D.  J.  Org.  Chem.  1998.  63,  1846-1852. 

(52)  Weiss,  R.  H.;  Furfine,  E.;  Hausleden,  E.;  Dixon,  D.  W.  J.  Org.  Chem. 
1984,  49,  4969-4972. 
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DMSO-*)  (3  163.95,  151.15,  148.96,  148.04,  145.89,  144.73,  142.23, 
123.12,  119.83,  115.81,  113.15,  101.60,  71.94,  69.76,  28.46;  UV/vis 
A max  273  nm;  HRMS  ( m/z )  [M  +  H]+  calcd  for  Ci5Hi7N406  349.11, 
found  349.11. 

The  second  set  of  oxime  dimers  based  on  the  3-(3)-13  hit  discovered 
in  the  first  screening  round  were  synthesized  in  an  identical  fashion  as 
described  above  using  uracil  aldehyde  3  and  hydroxybenzaldehydes 
18—42  and  the  0,0'-diaminopropanediol  linker  (cf.  Supporting  Infor¬ 
mation  Table  S2).  The  most  potent  inhibitor  identified  from  this  second 
round  of  screening  (3-(3)-27)  was  synthesized  in  larger  scale  and 
thoroughly  characterized.  3-(3)-27:  'H  NMR  (400  MHz,  DMSO-*)  d 
9.10  (bs,  H),  8.21  (s,  1  H),  7.94  (s,  1  H),  6.88  (s,  1  H),  6.31  (s,  1  H), 
5.78  (s,  1  H),  4.28  (t,  J  =  6.0  Hz,  2  H),  4.10  (t,  J  =  6.0  Hz,  2  H),  2.06 
(m,  2  H);  13C  NMR  (125  MHz,  DMSO-*)  (3  163.91,  151.04,  150.25, 
149.17,  146.90,  144.63,  142.20,  138.73,  112.76,  107.78,  103.56,  101.69, 
71.90,  69.77,  28.38;  UV/vis  2max  286  nm;  ESI  (m/z)  for  [M  +  H]+ 
calcd  for  Ci5Hi8N407  366,  found  366;  ESI  (m/z)  for  [M  +  Na]+  calcd 
for  Ci5Hi7N407Na  388,  found  388;  ESI  (m/z)  for  [M  —  H]~  calcd  for 
Ci5Hi6N407  364,  found  364. 

Isolation  and  Purification  of  Oxime  Dimers  using  HPLC.  All  of 

the  most  active  oxime  heterodimers  were  purified  by  HPLC  using  a 
Phenomenex  Aqua  reversed  phase  C-18  HPLC  column  (250  mm,  10 
mm,  5  fim).  Most  of  the  oximes  were  purified  using  gradient  elution 
from  0  to  30%  CH3CN  in  0. 1  M  aqueous  TEAA  over  the  course  of  2 
h  using  UV  detection  at  254  nm.  An  exception  was  oxime  3-(5)-27, 
which  is  prone  to  air  oxidation.  In  this  case,  25  mM  2-mercaptoethanol 
was  added  to  both  of  the  running  buffers.  The  oximes  all  eluted  with 
baseline  resolution  in  the  order  U— U  homodimer,  U— R  heterodimer, 
followed  by  the  R— R  homodimer.  This  HPLC  method  was  also  used 
to  confirm  the  expected  1:2:1  stoichiometries  of  homodimer  and 
heterodimer  oxime  formation,  using  10  representative  uracil  and  aryl 
aldehydes  from  the  library  (see  Supporting  Information  Figure  SI). 
Additional  NMR  evidence  supporting  the  expected  stoichiometries  is 
detailed  in  the  Supporting  Information  Figures  S2  and  S3. 

Synthesis  of  2-R-Substituted  3,4-Dihydroxybenzaldehydes  and 
the  Corresponding  Mixed  Oximes  with  3.  Aldehyde  43  was 
synthesized  by  removing  the  methyl  groups  of  the  commercially 
available  3,4-dimethoxy-6-fluorobenzaldehyde  using  BBr3  in  CH2CI2.53 
The  aldehydes  44  and  45  were  synthesized  by  removing  the  methylene 
group  of  the  corresponding  2-halogenated  piperonal  using  A1C13  and 
6N  HC1.54  Aldehyde  46  was  commercially  available.  These  four 
aldehydes  (43—46)  were  reacted  with  6-formyluracil  3  and  the  0,0'- 
diaminopropanediol  linker  using  the  procedure  described  above,  and 
the  mixed  oxime  dimer  was  obtained  after  HPLC  purification. 

3-(3)-43:  ‘H  NMR  (400  MHz,  DMSO-*)  <5  8.11  (s,  1  H),  7.94  (s, 
1  H),  7.05  (d,  J  =  7.2  Hz,  1  H),  6.58  (d,  J  =  6.8  Hz,  1  H),  5.77  (s,  1 
H),  5.10  (bs,  H),  4.28  (t,  J  =  6.8  Hz,  2  H),  4.16  (t,  J  =  6.0  Hz,  2  H), 
2.07  (m,  2  H);  13C  NMR  (125  MHz,  DMSO-*)  <5  163.91,  155.50, 
153.09,  151.08,  149.87,  149.76,  144.65,  142.72,  142.30,  142.21,  110.91, 
110.87,  108.62,  108.49,  103.21,  102.96,  101.67,  71.85,  70.06,  28.38; 
19F  NMR  (DMSO-*)  <5  -54.33,  -54.35,  -54.36,  -54.38;  UV/vis  Amax 
268  nm;  HRMS  (m/z)  [M  +  Na]+  calcd  for  Ci5H15FN406Na  389.09, 
found  389.09. 

3-(3)-44:  ‘H  NMR  (400  MHz,  DMSO-*)  (5  8.23  (s,  1  H),  7.94  (s, 
1  H),  7.18  (s,  1  H),  6.77  (s,  1  H),  5.78  (s,  1  H),  4.27  (t,  J  =  5.6  Hz,  2 
H),  4.16  (t,  J  =  6.0  Hz,  2  H),  2.08  (m.  2  H);  13C  NMR  (125  MHz, 
DMSO-*)  d  163.90,  151.08,  149.71,  145.48,  145.16,  144.64,  142.21, 
123.00,  119.09,  116.08,  112.33,  101.67,  71.85,  70.22,  28.37;  UV/vis 
A max  275  nm;  HRMS  (m/z)  [M  +  Na]+  calcd  for  Ci5H15ClN406Na 
405.06,  found  405.06. 

3-(3)-45:  ‘H  NMR  (400  MHz,  DMSO-*)  (5  8.18  (s,  1  H),  7.94  (s, 
1  H),  7.19  (s,  1  H),  6.93  (s,  1  H),  5.78  (s,  1  H),  4.27  (t,  J  =  6.4  Hz,  2 


(53)  Kirk,  K.  L.;  Cantacuzene.  D.;  Nimitkitpaisan,  Y.;  Mcculloh.  D.;  Padgett, 
W.  L.;  Daly.  J.  W.;  Creveling,  C.  R.  J.  Med.  Chem.  1979,  22,  1493-1497. 

(54)  Reitz,  A.;  Avery,  M.  A.;  Verlander,  M.  S.;  Goodman,  M.  J.  On;.  Chem. 
1981,  46,  4859-4863. 
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H),  4.16  (t,  J  =  6.4  Hz,  2  H),  2.08  (m,  2  H);  13C  NMR  (125  MHz, 
DMSO-*)  <5  163.91,  151.10,  150.07,  147.41,  146.02,  144.66,  142.22, 
120.58,  119.15,  112.89,  112.29,  101.69,  71.86,  70.24,  28.38;  UV/vis 
2, maX  278  nm;  HRMS  (m/z)  [M  +  Na]+  calcd  for  Ci5Hi5BrN4C>6Na 
449.01,  found  449.01. 

3-(3)-46:  ‘H  NMR  (400  MHz,  DMSO-*)  <3  8.56  (d,  J  =  1.2  Hz,  1 
H),  7.95  (s,  1  H),  7.37  (s,  1  H),  6.74  (s,  1  H),  6.26  (bs,  H),  5.78  (d,  J 
=  1.2  Hz,  1  H),  4.28  (t,  J  =  6.0  Hz,  2  H),  4.16  (t,  /  =  6.0  Hz,  2  H), 
2.09  (m,  2  H);  13C  NMR  (125  MHz,  DMSO-*)  <5  163.88,  160.27, 
151.05,  148.62,  148.00,  144.62,  142.22,  133.36,  122.38,  113.80,  109.23, 
101.73,  71.85,  70.07,  28.39;  UV/vis  2max  269  nm;  HRMS  (m/z)  [M  + 
H]+  calcd  for  Ci5H16N508  394.10,  found  394.10. 

Synthesis  of  Methyl  Oxime  Derivatives  of  1—3,  13,  and  27.  The 
O-methyl  oxime  of  3,4-dihydroxybenzaldehyde  (47)  is  known  and  was 
synthesized  using  13  and  O-methylhydroxylamine  hydrochloride.55 
O-Methyloximes  48—51  were  made  using  a  similar  method. 

48:  To  a  solution  of  27  (308  mg,  2.0  mmol)  in  4.0  mL  of  EtOH— 
H20— THF  (0.45/0.3/0.25)  were  added  sodium  acetate  (264  mg)  and 
O-methylhydroxylamine  hydrochloride  (183  mg),  and  the  solution  was 
stirred  at  room  temperature  for  overnight.  The  solvents  were  removed 
in  vacuo,  and  the  residue  was  extracted  with  chloroform  three  times. 
The  combined  organic  layers  were  dried  over  anhydrous  MgS04  and 
concentrated  in  vacuo.  The  residue  was  purified  by  silica  gel  column 
chromatography  (EtOAc/hexanes)  to  give  a  product  amount  of  347  mg 
in  95%  yield:  ‘H  NMR  (400  MHz,  DMSO-*)  (5  9.38  (s,  1  H),  9.21 
(s,  1  H),  8.52  (s,  1  H),  8.18  (s,  1  H),  6.88  (s,  1  H),  6.30  (s,  1  H),  3.80 
(s,  3  H);  13C  NMR  (125  MHz,  DMSO-*)  <5  150.17,  148.88,  146.52, 
138.61,  1 12.51,  107.77,  103.46,  61.28;  UV/vis  /W  239,  274  nm;  HRMS 
(m/z)  [M  +  H]+  calcd  for  CsH10NO4  184.06,  found  184.06. 

49:  To  a  solution  of  1  (10.8  mg,  0.063  mmol)  in  hot  DMF  (0.5 
mL)  were  added  sodium  acetate  (5.2  mg,  0.063  mmol)  solution  in  water 
(0.1  mL)  and  O-methylhydroxylamine  hydrochloride  (5.3  mg,  0.063 
mmol),  and  the  solution  was  stirred  at  room  temperature  for  overnight. 
The  solvents  were  removed  in  vacuo,  and  the  residue  was  purified  by 
column  chromatography  using  10—15%  (v/v)  methanol  in  CH2CI2, 
resulting  in  90%  yield  (10.3  mg  50/50  mixture  of  trans  and  cis  geometric 
isomers):  'H  NMR  (400  MHz,  chloroform-*  (5  9.56  (s,  1  H),  7.43  (t, 
J  =  5.2  Hz, 0.5  H),  7.20  (m,  1  H),  6.80  (t,  J  =  4.4  Hz,0.5  H),  5.78  (m, 
1  H),  4.55  (d,  J  =  4.4  Hz,l  H),  4.48  (d,  J  =  5.6  Hz,l  H),  3.94  (s,  1.5 
H),  3.87  (s,  1.5  H);  13C  NMR  (100  MHz,  DMSO-*)  <5  163.92,  163.87, 
151.06,  150.99,  144.72,  144.54,  143.93,  143.53,  103.14,  102.97,  62.68, 
62.37,  46.52,  43.76;  UV/vis  Amax  263  nm;  HRMS  (m/z)  [M  +  H]+  calcd 
for  C7H10N3O3  184.07,  found  184.07. 

50:  To  a  solution  of  2  (70  mg,  0.5  mmol)  in  hot  DMF  (1  mL)  were 
added  sodium  acetate  (41  mg,  0.5  mmol)  solution  in  water  (0.5  mL) 
and  O-methylhydroxylamine  hydrochloride  (42  mg,  0.5  mmol),  and 
the  solution  was  stirred  at  room  temperature  for  4  h.  The  solvents  were 
removed  in  vacuo,  and  the  residue  was  collected  by  filtration  and 
washed  with  cold  water  2  x  1  mL,  resulting  in  76%  yield  (70  mg 
87/13  mixture  of  trans  and  cis  geometric  isomers):  'H  NMR  (400  MHz, 
DMSO-*)  <3  11.40  (bs,  2  H),  8.52  (s,  0.13  H),  7.87  (s,  0.87  H),  7.74 
(s,  0.87  H),  7.29  (s,  0.13  H),  3.89  (s,  0.39  H),  3.80  (s,  2.61  H);  13C 
NMR  (100  MHz,  DMSO -*)  <5  162.98,  162.36,  150.80,  150.27,  146.08, 
142.26,  140.09,  137.31,  104.41,  103.43,  62.32,  61.44;  UV/vis /lmax  288 
nm;  HRMS  (m/z)  [M  +  H]+  calcd  for  C6HgN303  170.06,  found  170.06. 

51:  To  a  solution  of  3  (79  mg,  0.5  mmol)  in  hot  DMF  (2.0  mL) 
were  added  sodium  acetate  (46  mg,  0.5  mmol)  solution  in  water  (0.5 
mL)  and  O-methylhydroxylamine  hydrochloride  (46  mg,  0.5  mmol), 
and  the  solution  was  stirred  at  50  °C  for  4  h.  The  solvents  were  removed 
in  vacuo,  and  the  residue  was  washed  by  cold  water.  After  the  filtration, 
product  was  obtained  in  62%  yield  (53  mg):  'H  NMR  (400  MHz, 
DMSO-*)  <5  11.18  (s,  1  H),  10.77  (s,  1  H),  7.91  (s,  1  H),  5.77  (s,  1 
H),  3.96  (s,  3  H);  13C  NMR  (125  MHz,  DMSO-*)  <5  163.87,  151.02, 


(55)  Watanabe,  T.;  Suzuki,  T.;  Umezawa,  Y.;  Takeuchi,  T.;  Otsuka,  M.; 
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144.51,  142.17,  101.41,  62.84;  UV/vis  Amax  292  nm;  HRMS  (m/z)  [M 
+  Na]+  calcd  for  C6H7N303Na  192.04,  found  192.04. 

High-Throughput  Inhibitor  Screening.  The  substrate  in  this  HTS 
assay  was  synthesized  using  standard  phosphoramidite  DNA  solid-phase 
chemistry  using  reagents  purchased  from  Glen  Research.  The  DNA 
was  purified  using  anion  exchange  chromatography  followed  by 
desalting  using  reversed  phase  methods.  The  sequence  and  size  was 
confirmed  using  analytical  denaturing  polyacrylamide  gel  electrophore¬ 
sis  and  MALD1-MS.  The  substrate  is  a  double-stranded  14-mer  DNA 
containing  nine  U-A  base  pairs  (5'-FAM-GCA  CUU  AAG  AAU  UG: 
3'-DABSYL-CA  AUU  CUU  AAG  UGC).  The  UNG  HTS  assay  is 
performed  as  follows.  To  a  96-well  microtiter  plate  was  added  5  /.iL  (2 
mM  total)  of  compound  in  DMSO,  followed  by  75  /.iL  (33.3  pM)  of 
human  UNG  in  reaction  buffer  (10  mM  Tris-HCl,  pH  8.0,  20  mM  NaCl, 
7.5  mM  MgCE,  0.002%  brij-35).  The  reactions  were  initiated  by  the 
addition  of  20  /xL  (250  nM)  of  molecular  beacon  substrate  in  reaction 
buffer.  The  plates  are  incubated  at  ambient  temperature  in  a  fluorescence 
plate  reader  for  30  min,  and  the  progress  of  the  reaction  was  monitored 
every  5  min  (ex  485  nm/em  520  nm).  The  final  concentrations  of  the 
reagents  in  the  assay  are  10  mM  Tris-HCl,  pH  8.0,  20  mM  NaCl,  7.5 
mM  MgCU,  0.002%  Brij-35,  25  pM  human  UNG,  50  nM  molecular 
beacon  substrate,  100  «M  total  compound,  and  5%  DMSO.  The  MgCh 
is  essential  to  increase  the  stability  of  the  double-stranded  DNA 
substrate  and,  thus,  decrease  the  initial  fluorescence  of  the  molecular 
beacon  and  increase  the  maximum  signal  of  the  assay.  Addition  of 
Brij-35,  a  nonionic  detergent,  is  essential  to  stabilize  human  UNG  at 
the  low  concentration  used  in  this  assay.  A  similar  assay  has  been 
described  by  Maksimenko  et  al.  that  utilizes  a  39-mer  hairpin  DNA.56 
However,  the  synthesis  and  purification  of  this  more  complex  substrate 
proceeds  with  low  efficiency  and  requires  higher  temperature  to  induce 
strand  separation  (Krosky  and  Stivers,  unpublished  data).  In  contrast, 
the  14-mer  double-stranded  molecular  beacon  is  routine  and  allows 
screening  to  be  performed  conveniently  at  room  temperature. 

Mechanism  of  Inhibition.  The  substrate  used  in  mechanism  of 
inhibition  studies  was  a  modified  DNA  hairpin  where  the  two  strands 
described  above  are  connected  by  a  hexakis[poly(ethylene  glycol)} 
linker  (PEG-U9).  This  substrate  was  easier  to  synthesize  and  purify 
than  an  all-DNA  haitpin  and,  unlike  the  double  stranded  DNA  substrate, 
does  not  require  MgCb  to  achieve  minimum  fluorescence.  To  a  96- 
well  plate  was  added  5  /.iL  of  compound  in  DMSO,  followed  by  75  ph 
of  PEG-U9  hairpin  in  reaction  buffer  (20  mM  Tris-HCl,  pH  8.0,  50 
mM  KC1,  0.2  mM  MgCl2,  0.002%  Brij-35,  1  mM  DTT).  Eight  different 
DNA  concentrations  were  used  in  the  range  62.5—2000  nM.  Reactions 
were  initiated  by  the  addition  of  20  fiL  of  0.5  nM  human  UNG  in 
reaction  buffer.  The  final  concentrations  of  reagents  in  the  assay  are 
20  mM  Tris-HCl,  pH  8.0,  50  mM  KC1,  0.2  mM  MgCl2,  0.002%  Brij- 
35,  1  mM  DTT,  5%  DMSO,  0.1  nM  human  UNG,  62.5-2000  nM 


(56)  Maksimenko,  A.:  Ishchenko,  A.  A.;  Sanz,  G.;  Laval,  J.;  Elder,  R.  H.; 
Saparbaev,  M.  K.,  Biochem.  Biophys.  Res.  Commun.  2004,  319,  240—246. 


PEG-U9  hairpin  DNA,  and  variable  amounts  of  inhibitor.  The  plates 
were  incubated  at  ambient  temperature  in  a  fluorescence  plate  reader 
for  60  min,  and  the  progress  of  each  reaction  was  monitored  every  5 
min  (ex  485  nm/em  520  nm).  Afterward,  Escherichia  coli  UNG  was 
added  to  each  well  to  drive  the  reactions  to  completion,  and  the  overall 
change  in  fluorescence  values  were  measured.  These  values  were  used 
to  convert  initial  velocities  from  units  of  fluorescence  units/s  to 
[product]/s.  Mechanisms  of  inhibition  and  their  corresponding  inhibitor 
dissociation  constants  were  determined  by  Lineweaver— Burk  slope  and 
intercept  replot  analysis  and  by  computational  simulations  of  the  initial 
velocity  against  inhibitor  concentration  data  using  Dynafit  v.3.28  (see 
Supporting  Information) 

Conclusions 

We  have  developed  an  efficient  strategy  to  develop  small- 
molecule  inhibitors  of  UNG  that  have  the  potential  for  activity 
in  cell  culture  or  in  vivo.  The  method  is  quite  general  and  could 
be  adapted  to  target  other  enzymes  that  bind  extrahelical  bases 
or  free  nucleosides.  Two  future  targets  of  the  current  uracil 
mixed  oxime  library  would  be  the  essential  bacterial  enzyme 
deoxyuridine  nucleotidylhydrolase,  which  converts  dUTP  to 
dUMP, 23,57-60  and  human  thymidine  phosphorylase,  an  enzyme 
implicated  in  vascularization  of  tumors.61  Such  inhibitors  could 
serve  as  useful  tools  to  study  the  life  cycle  of  pathogenic  human 
viruses,  the  biology  of  uracil  base  excision  repair  in  normal 
cell  lines  and  tissues,  and  mechanisms  of  tumor  vascularization. 

Acknowledgment.  This  work  was  supported  by  NIH  Grant 
GM56834-10  to  J.T.S.  D.J.K  was  supported  by  the  DOD  Breast 
Cancer  Research  Program  (Grant  DAMD17-03-1-1251). 

Supporting  Information  Available:  Tables  of  aryl  aldehydes 
and  hydroxybenzaldehydes  used  to  construct  oxime  libraries, 
HPLC  traces  and  NMR  spectra  to  ascertain  stoichiometries  of 
oxime  mixtures,  'H  and  13C  NMR  spectra  of  3-(3)-13  and  3 -(3)- 
27,  computer  simulations  of  inhibition  data,  and  analytical 
equations  for  each  inhibition  mechanism.  This  material  is 
available  free  of  charge  via  the  Internet  at  http://pubs.acs.org. 

JA055846N 


(57)  Grasser,  F.  A.;  Romeike.  B.  F.;  Niedobitek,  G.;  Nicholls,  J.;  Kremmer,  E. 
Curr.  Protein  Pept.  Sci.  2001,  2,  349—360. 

(58)  Hidalgo-Zarco,  F.;  Gonzalez-Pazanowska,  D.  Curr.  Protein  Pept.  Sci.  2001, 
2,  389-397. 

(59)  Studebaker,  A.  W.:  Balendiran,  G.  K.;  Williams,  M.  V.  Curr.  Protein  Pept. 
Sci.  2001,  2,  371-379. 

(60)  Williams,  M.  V.  Virology  1988,  166 ,  262-264. 

(61)  Toi,  M.;  Atiqur  Rahman,  M.;  Bando,  H.:  Chow,  L.  W.  Lancet  Oncol.  2005, 
6,  158-166. 


PAGE  EST:  8.7 


J.  AM.  CHEM.  SOC.  I 


Bioorganic  & 

Medicinal 

Chemistry 


ELSEVIER 


Available  online  at  www.sciencedirect.com 


SCIENCE 


DIRECT  ® 


Bioorganic  &  Medicinal  Chemistry  14  (2006)  5666-5672 


Synthesis  and  high-throughput  evaluation  of  triskelion  uracil 
libraries  for  inhibition  of  human  dUTPase  and  UNG2 

Yu  Lin  Jiang,  Suhman  Chung,  Daniel  J.  Krosky  and  James  T.  Stivers* * 

Department  of  Pharmacology  and  Molecular  Sciences,  Johns  Hopkins  University  School  of  Medicine, 

725  North  Wolfe  Street,  Baltimore,  MD  21205,  USA 

Received  22  February  2006;  revised  10  April  2006;  accepted  13  April  2006 
Available  online  5  May  2006 


Abstract — Human  nuclear  uracil  DNA  glycosylase  (UNG2)  and  deoxyuridine  triphosphate  nucleotidohydrolase  (dUTPase)  are  the 
primary  enzymes  that  prevent  the  incorporation  and  accumulation  of  deoxyuridine  in  genomic  DNA.  These  enzymes  are  desirable 
targets  for  small  molecule  inhibitors  given  their  roles  in  a  wide  range  of  biological  processes  ranging  from  chromosomal  rearrange¬ 
ments  that  lead  to  cancer,  viral  DNA  replication,  and  the  formation  of  toxic  DNA  strand  breaks  during  anticancer  drug  therapy.  To 
accelerate  the  discovery  of  such  inhibitors,  we  have  developed  a  high-throughput  approach  for  directed  library  synthesis  and  screen¬ 
ing.  In  this  efficient  technology,  a  uracil-aldehyde  ligand  is  covalently  tethered  to  one  position  of  a  trivalent  alkyloxyamine  linker  via 
an  oxime  linkage,  and  then  the  vacant  linker  positions  are  derivatized  with  a  library  of  aldehydes.  The  resulting  triskelion  oximes 
were  directly  screened  for  inhibitory  activity  and  the  most  potent  of  these  showed  micromolar  binding  affinities  to  UNG2  and 
dUTPase. 

©  2006  Elsevier  Ltd.  All  rights  reserved. 


1.  Introduction 

From  the  classic  view  of  DNA  repair  and  mutagenesis, 
the  uracil  base  has  no  place  in  genomic  DNA.1 
Accordingly,  elaborate  DNA  repair  mechanisms  have 
evolved  to  exclude  dUTP  from  the  nucleotide  pool 
used  for  DNA  replication,5,6  and  to  remove  uracil  from 
DNA  when  it  arises  from  spontaneous  deamination  of 
cytosine  bases.7  However,  the  uracil  base  has  recently 
been  found  to  play  a  much  more  diverse  role  in  human 
biology,  disease,  and  anticancer  therapy  (Fig.  1).  Sur¬ 
prisingly,  the  uracil  excision  repair  machinery  has  been 
found  to  participate  in  the  process  of  generating 
somatic  mutations  during  antibody  maturation  in  B 
cells, 8-10  and  uracil  incorporation  and/or  removal  is 
critical  in  the  life  cycles  of  herpes,11  cytomegalo,12 
pox,13,14  and  type  1  human  immunodeficiency  viruses 
(HIV-1).15  Furthermore,  this  pathway  also  generates 
the  pharmacologically  active  single  and  double  strand 
DNA  breaks  that  are  the  essential  tumor  killing  lesions 
produced  by  the  widely  used  anticancer  drugs  5-fluoro- 
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Figure  1.  Biological  effects  of  uracil  in  DNA.2  4 


uracil  and  methotrexate,1617  and  generates  the  charac¬ 
teristic  chromosomal  translocations  found  in  some  B 
cell  lymphomas.  Thus,  pharmacologic  agents  that 
inhibit  these  processes  are  desirable  for  both  investiga¬ 
tional  and  therapeutic  purposes. 

Human  nuclear  uracil  DNA  glycosylase  (UNG2) 
and  deoxyuridine  triphosphate  nucleotidohydrolase 
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(dUTPase)  are  the  primary  enzymes  that  prevent  the 
incorporation  and  accumulation  of  deoxyuridine  in  geno¬ 
mic  DNA.17'18  Given  that  these  enzymes  are  emerging  as 
interesting  pharmacologic  targets,  we  have  sought  out 
methods  for  the  rapid  and  efficient  identification  of  small 
molecule  ligands  that  can  inhibit  their  activity.  One  of  the 
most  exciting  potential  applications  of  UNG2  and  dUT¬ 
Pase  inhibitors  would  be  as  antiretroviral  agents.  Recent 
findings  have  established  that  HIV-1  specifically  packag¬ 
es  UNG2  into  virus  particles  via  interaction  with  the 
virus  encoded  integrase  protein  (Int),  or  perhaps  a  ter¬ 
nary  complex  between  UNG,  Int  and  the  viral  Vpr  pro¬ 
tein. 1 119  26  UNG2  is  required  for  infection  of 
nondividing  cells  such  as  macrophages  and  resting  T 
cells,  and  virus  particles  produced  from  UNG  depleted 
cells  are  incapable  of  infecting  new  target  cells.15,27 
Infection  of  macrophages  helps  maintain  a  viral  reser¬ 
voir  in  the  host  that  is  crucial  for  virus  spread  to  the 
lymphoid  organs  and  T-helper  lymphocytes,  and  ulti¬ 
mately,  AIDS  pathogenesis.21,28  UNG  is  apparently 
recruited  to  minimize  uracil  incorporation  into  the  viral 
genome  in  these  cells,  which  have  naturally  high  levels  of 
dUTP,  a  good  substrate  for  the  viral  reverse  transcrip¬ 
tase.29  Inhibition  of  dUTPase  would  be  expected  to  fur¬ 
ther  increase  dUTP  levels  in  macrophages,  resulting  in 
even  more  uracil  misincorporation  into  the  viral 
genome  (Fig.  1).  Pharmacologic  targeting  of  a  UNG2 
and  dUTPase  is  extremely  attractive  because  these  tar¬ 
gets  would  not  be  susceptible  to  the  same  high  mutagen¬ 
esis  rate  and  resulting  drug  resistance  as  viral  encoded 
proteins.30  Targeting  UNG2  is  a  viable  therapeutic 
strategy  because  it  is  not  an  essential  enzyme.  Thus, 
UNG  knock-out  mice  display  no  remarkable  pheno¬ 
type,  nor  do  UNG  null  yeast  or  human  cell  lines.31 
Although  dUTPase  is  an  essential  enzyme  in  all  organ¬ 
isms,  it  would  be  expected  that  rapidly  replicating  virus¬ 
es  such  as  HIV-1  would  show  higher  sensitivity  than  the 
host,  providing  a  potential  therapeutic  window. 


Herein,  we  report  an  integrated  high-throughput  (HTP) 
platform  for  the  synthesis  and  evaluation  of  uracil- 
directed  small  molecule  libraries  based  upon  triskelion 
oxyamine  scaffolds.  The  strategy  is  to  attach  a  uracil- 
aldehyde  ligand  to  one  or  two  arms  of  the  triskelion 
scaffold  and  then  derivatize  the  vacant  position(s)  with 
a  random  library  of  aldehydes  (RCHO).  The 
uracil  moiety  is  expected  to  weakly  target  the  fully 
functionalized  compound  to  the  active  site  rather  than 
irrelevant  regions  of  the  enzyme,  and  the  random 
functional  groups  can  then  explore  nearby  binding 
pockets  resulting  in  increased  affinity  over  that  of  the 
uracil  alone.  Library  compounds  are  rapidly  screened 
using  robust  HTP  activity  assays,  from  which 
several  inhibitors  of  UNG2  and  dUTPase  have  been 
identified. 


2.  Results  and  discussion 

2.1.  Synthesis  of  uracil  triskelion  oxime  libraries 

We  sought  an  inhibitor  development  strategy  that  al¬ 
lowed  rapid  and  economical  synthesis  of  small  molecule 
ligands  that  explore  binding  sites  near  the  UNG2  and 
dUTPase  active  sites,  and  which  could  be  used  directly 
in  HTP  screening  applications  without  purification. 
One  efficient  synthesis  strategy  that  meets  these  criteria 
is  outlined  in  Schemes  1  and  2.  First,  a  triskelion  oxy¬ 
amine  scaffold  is  synthesized  in  two  steps  from 
tris(hydroxymethyl)methane  (Scheme  1).  Then  the  three 
oxyamine  groups  are  derivatized  with  a  uracil-aldehyde 
and  a  library  of  215  aldehyde  binding  elements  (RCHO, 
Table  1)  via  the  formation  of  stable  oxime  linkages 
(Scheme  2). 

Each  linking  reaction  is  carried  out  in  one  well  of  a  96- 
well  microtiter  plate  that  contains  one  molar  equivalent 


O 


Scheme  1.  Reagents  and  conditions:  (a)  triphenylphosphine,  diisopropyl  azodicarboxylate,  JV-hydroxyphthalimide,  anhydrous  THF,  overnight,  0  °C, 
53%  yield:  (b)  anhydrous  NH2NH2,  95%  ethanol,  room  temperature,  2  h,  67%.  See  Ref.  32. 
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Scheme  2.  Reagents  and  conditions:  AcOH  (20%  v/v),  DMSO,  12  h,  37  °C. 
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Table  1.  Representative  aryl  aldehydes  (RCHO)  used  in  library  synthesis3 


“The  entire  library  consisted  of  215  aryl  aldehydes. 


uracil  aldehyde,  two  molar  equivalents  RCHO  library 
member,  and  one  molar  equivalent  oxyamine  triskelion 
scaffold  (Scheme  2).  The  reactions  typically  proceed  to 
85-99%  completion  after  overnight  incubation  (DMSO, 
37  °C),  and  produce  a  statistical  mixture  of  the  homotri- 
meric  (UUU,  RRR)  and  heterotrimeric  (UUR,  RRU) 
oximes  in  the  approximate  amounts  indicated  in  Scheme 
2  (see  supplemental  materials).  No  compound  purifica¬ 
tion  is  required  before  screening  the  library  for  active 
compounds. 

2.2.  High-throughput  screening  of  triskelion  libraries 
against  UDG 

We  have  previously  developed  a  high-throughput  fluo¬ 
rescence  assay  for  UNG2  that  allows  screening  of  chem¬ 
ical  libraries.33  This  assay  was  used  to  screen  the  215 
oxime  mixtures  obtained  from  reaction  of  the  triskelion 
oxyamine  scaffold  with  uracil  5  and  library  aldehydes  6 
through  34.  Mixtures  that  showed  inhibitory  activity 
were  subjected  to  fractionation  using  reversed-phase 
HPLC,  and  then  the  individual  purified  components 
were  reevaluated  to  determine  which  species  was  respon¬ 
sible  for  the  inhibition.  After  identifying  the  inhibitory 
molecules,  they  were  resynthesized  in  larger  scale, 
purified,  and  complete  IC50  curves  were  determined  as 
previously  described.33 

Two  RCHO  groups  were  found  to  be  inhibitory  when 
attached  to  uracil  in  the  triskelion  scaffold  (Table  2). 
The  1C50  values  were  found  to  fall  in  the  range  ~0.9 
to  1 1  pM.  When  RCHO  =  3,4  dihydroxybenzaldehyde, 
both  the  RRU  (45)  and  UUR  (46)  variants  showed 
nearly  equal  activity,  suggesting  that  the  enzyme  recog¬ 
nizes  the  UR  element,  but  not  the  third  substituent  on 
the  scaffold  (U  or  R).  Consistent  with  this  interpreta¬ 
tion,  the  corresponding  bifunctional  oxime  (UR,  47, 
Table  2)33  showed  a  similar  IC50  value  as  45  and  46. 
In  contrast,  when  RCHO  =  3-carboxybenzaldehyde, 
the  trivalent  forms  48  and  49  were  found  to  be  6- 
to  12-fold  more  potent  than  the  bifunctional  oxime 
50.  Thus,  in  this  latter  case  the  third  substituent  has  a 
significant  effect  on  binding  affinity. 


2.3.  High-throughput  screening  of  triskelion  libraries 
against  dUTPase 

A  similar  strategy  was  used  to  screen  the  215  oxime  mix¬ 
tures  obtained  from  reaction  of  the  triskelion  scaffold 
with  uracil  aldehyde  4  and  library  aldehydes  6  through 
34  against  human  dUTPase.  In  this  case,  only  one  inhib¬ 
itory  RCHO  group  was  found  (Table  3),  and  both  the 
RRU  (51)  and  UUR  (52)  variants  provided  comparable 
IC50  values  in  the  range  3-5  pM.  To  the  best  of  our 
knowledge,  51  and  52  are  the  most  potent  nonnucleotide 
inhibitors  of  human  dUTPase  yet  reported. 

2.4.  Inhibition  by  the  untethered  parts 

The  key  question  in  determining  the  effectiveness  of  this 
tethering  strategy  is  the  inhibitory  capacities  of  the 
untethered  uracil  and  aldehyde  components.  For 
UNG2,  the  methyl  oxime  of  uracil  5  has  an  IC50  value 
of  75  pM,  and  no  inhibition  by  the  methyl  oxime  of  3,4 
dihydroxybenzaldehyde  (15)  could  be  detected  even  at 
concentrations  as  high  as  1  mM.  Thus,  tethering  15-5 
produced  an  increase  in  binding  affinity  of  75-fold  relative 
to  uracil  alone,  and  tethering  5-15  brought  about  at  least 
a  1000-fold  increase  in  binding  affinity  relative  to  15 
alone.  For  dUTPase,  the  methyl  oximes  of  4  and  8  were 
not  inhibitory  even  at  concentrations  as  high  as  1  mM. 
Thus  in  this  case,  tethering  of  the  two  parts  has  brought 
about  increases  in  binding  affinity  of  at  least  700-fold  as 
compared  to  the  separate  components.  A  trivial  but 
potentially  useful  modification  of  the  tethering  approach 
would  be  to  incorporate  two  different  R  groups  into  the 
triskelion  scaffold.  This  is  easily  accomplished  by  first 
synthesizing  and  isolating  the  monoderivatized  uracil 
compound  and  then  reacting  the  remaining  two  oxyamine 
positions  with  a  mixture  of  two  aldehydes  (unpublished). 


3.  Conclusion 

We  have  established  that  triskelion  libraries  of  uracil 
derivatives  can  efficiently  yield  inhibitors  with  micromo¬ 
lar  affinities  for  two  different  enzymes  that  recognize  the 
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Table  2.  Structures  and  inhibitory  constants  for  UNG2  inhibitors3 
Triskelion  compound  IC50  (pM) 


Table  3.  Structures  and  inhibitory  constants  for  dUTPase  inhibitors 
Triskelion  compound  IC50  (pM) 


46 


OH 


OH 


HNffNH 


47 


48 


49 


1.6  ±0.1 


0.9  ±  0.1 


uracil  base.  Unlike  previous  nucleic  acid-based  inhibitors 
of  UNG. 34,35  and  nucleotide-based  inhibitors  of  dUT¬ 
Pase,36  these  library  compounds  are  expected  to  be  cell 
permeable.  A  useful  extension  of  this  approach  is  current¬ 
ly  being  developed  where  the  length  of  the  linker  arms  is 
varied.  This  modified  approach  is  expected  to  generate 
more  diverse  libraries  that  allow  more  comprehensive 
probing  of  potential  binding  sites  near  the  uracil  pocket. 

With  respect  to  the  in  vivo  utility  of  such  oxime  libraries, 
there  are  a  number  of  currently  used  drugs  with  oxime 
functional  groups:  the  selective  serotonin  reuptake  inhib¬ 
itor,  fluvoxamine, 37,38  the  monobactam  antibiotic,  aztre- 
onam,38  and  several  preclinical  antimicrobial  drugs.39,40 
The  activity  of  these  drugs  indicates  that  oxime  linkages 
are  stable  and  useful  in  real  clinical  applications.  Never¬ 
theless,  oximes  are  susceptible  to  reduction  in  metabolic 
reactions  involving  cytochrome  P450-mediated  transfor¬ 
mations.40,41  Depending  on  the  pharmacokinetic  and 
pharmacodynamic  properties  of  the  individual  oximes, 
this  may  or  may  not  pose  a  problem.  For  instance  fluvox¬ 
amine,  although  extensively  processed  in  first-pass 
metabolism,  has  a  reasonable  serum  half-life  of 
12  h.37,38  We  anticipate  that  triskelion  libraries  based 
on  substrate  fragments  will  be  useful  for  rapid  inhibitor 
development  against  a  variety  of  enzymes. 


COOH 


,0nVt° 

HN^NH 

TT 


50 


10.9  ±0.5 


a  See  Ref.  33  for  representative  HTS  screening  data  and  IC50  curves. 


4.  Experimental 

4.1.  Reagents  and  general  methods 

All  chemicals  were  purchased  from  commercial  sources 
without  further  purification  unless  otherwise  stated. 
The  'H  and  13C  NMR  spectra  were  recorded  on  a 
400  MHz  Varian  Innova  instrument.  The  spectra  were 
recorded  in  deuteriochloroform  (CDC13)  or  in  hexadeu- 
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teriodimethylsulfoxide  (DMSO-76).  The  chemical  shifts 
of  protons  are  given  in  ppm  with  TMS  as  internal  stan¬ 
dard.  The  chemical  shifts  of  carbons  are  obtained  in 
ppm  with  solvents  as  internal  standards.  Oximes  were 
purified  by  HPLC  using  aqueous  triethylammonium 
acetate  (TEAA)  as  a  running  buffer.  Therefore,  TEAA 
was  not  completely  removed  and  it  appeared  in  the 
NMR  spectra.  Accordingly,  proton  and  carbon  chemi¬ 
cal  shifts  of  TEAA  were  not  listed  during  the  character¬ 
izations  of  the  oximes.  During  purification  of  the  oximes 
45  and  46,  2-mercaptoethanol  was  used  as  an  anti-oxi¬ 
dant.  Therefore,  small  amounts  of  this  compound  and 
its  oxidation  product  are  also  present  in  these  oximes. 
Flash  chromatographies  were  performed  with  silica 
(70-230  mesh  from  Sorbent  Technologies)  and  moni¬ 
tored  by  thin-layer  chromatography  (TLC)  with  silica 
plates  (Merck,  Kieselgel  60  F254). 

4.2.  2,2',2"-[(2-Methyloxy-l,3-propanedioxy)tris]-l//- 
isoindole-1,  3(2//)-dione  (2) 

To  a  suspension  of  2-hydroxymethyl- 1, 3-propanediol  1 
(0.848  g,  8.0  mmol),  triphenylphosphine  (7.32  g,  28  nnnol), 
and  N-  h yd  roxy  p h  t  h a  I  i  m  i dc  (6.52  g,  40  mmol)  in  anhy¬ 
drous  TEIF  (60  ml)  was  added  diisopropyl  azodicarboxy- 
late  (5.64  ml,  28.0  mmol)  dropwise  at  0°C.42~44  The 
mixture  was  stirred  overnight,  and  the  precipitate  was  fil¬ 
tered  and  washed  with  cold  THF.  After  removal  of  the 
THF  in  vacuo,  product  2  was  obtained  (2.3  g)  in  53% 
yield.  *H  NMR  (400  MHz,  CDC13):  <5  7.76  (m,  12H), 
4.66  (d,  7  =  6.4  Hz,  6H),  2.78  (septet,  7  =6.4  Hz,  1H); 
13C  NMR  (100  MHz,  CDC13):  S  163.43,  134.39,  128.97, 
123.47,  75.52,  37.68.  HRMS  (mlz):  [M+H]+  calcd  for 
C28H2oN309,  542.12;  found,  542.12. 

4.3.  2-[(Aminooxy)-methyl]-l,3-bis(aminooxy)-propane  (3) 

To  a  suspension  of  2  (2.43  g,  4.5  mmol)  in  95%  ethanol 
(9.5  ml),  was  added  anhydrous  hydrazine  (0.67  ml, 
20.2  mmol)  dropwise  within  10  min  at  room  temperature. 
The  mixture  was  stirred  for  2  h,  filtered,  and  washed  with 
95%  ethanol.  The  filtrate  was  concentrated  in  vacuo  to 
give  a  residue.  To  the  residue  was  added  methylene  chlo¬ 
ride  (10  ml).  The  resulting  mixture  was  kept  overnight  at 
room  temperature,  filtered  the  following  morning,  and 
the  filtrate  was  concentrated  in  vacuo,  giving  product  3 
(0.46  g)  in  67%  yield.  *H  NMR  (400  MHz,  CDC13):  S 
3.71  (d,  7=6.0  Hz,  6H),  2.41  (septet,  7=6.4  Hz,  1H); 
13C  NMR  (100  MHz,  CDC13):  <5  74.66,  37.05;  HRMS 
(mlz):  [M+H]+  calcd  for  C4H14N303,  152.10;  found, 
152.10. 

4.4.  General  synthesis  of  tethered  triskelion  oximes 

A  set  of  39  commercially  available  aldehydes  were 
linked  to  the  two  uracil-containing  aldehydes  4  and  5 
using  the  triskelion  alkyloxyamine  linker  3  (see  Scheme 

2  and  Table  1,  and  supplementary  materials).  To  each 
well  of  a  0.5-ml  Matrix  micro  titer  plate  were  added 
DMSO  stock  solutions  of  AcOH  (20  pL,  150  mM, 

3  pmol),  uracil-containing  aldehyde  4  or  5  (20  pL, 
150  mM,  3  pmol),  and  one  library  aldehyde  (40  pL, 
150  mM,  6  pmol).  The  synthesis  and  characterization 


of  4  and  5  has  been  previously  reported.  The  plate  was 
carefully  agitated  to  make  the  solutions  homogeneous, 
and  22  pL  of  a  DMSO  solution  of  the  triskelion  oxy- 
amine  was  then  added  (150  mM,  3.3  pmol).  The  plate 
was  sealed  and  further  agitated  and  incubated  in  an 
oven  for  12  h  at  37  °C.  The  expected  statistical  ratios 
of  the  oxime  products  were  confirmed  by  *H  NMR  anal¬ 
ysis  (see  supplementary  materials). 

4.5.  Isolation  and  purification  of  inhibitory  triskelion  oximes 

The  most  potent  inhibitors  were  resynthesized  in  larger 
scale  and  thoroughly  characterized  after  HPLC  purifica¬ 
tion  of  the  mixed  oximes  using  a  Phenomenex  Aqua 
reversed-phase  C-18  HPLC  column  (250  mm,  10  mm, 
5  pm).  Gradient  elution  from  0%  to  65%  CH3CN  in 
0.1  M  aqueous  TEAA  over  the  course  of  2  h  with  UV 
detection  at  254  nm  was  used.  An  exception  was  oxime 
47,  which  is  prone  to  air  oxidation.  In  this  case,  25  mM 
2-mercaptoethanol  was  added  to  both  of  the  running 
buffers.  The  oximes  all  eluted  with  baseline  resolution  in 
the  order  (1)  the  homotrimer  oxime  derived  from  4  or  5, 
(2)  the  heterotrimer  oxime  derived  from  4  or  5  and  either 
8,15  or  19,  and  (3)  the  homotrimer  oxime  derived  from  8, 
15  or  19. 

4.6.  2-[  0-(6-UracilcarboxaIdoximyl)-methyl]-l  ,3-bis[ O- 
(3,4-dihydroxybenzaldoximyl)]-propane  (45) 

'HNMR  (400  MHz,  DMSO-76):  <5  11.15  (brs,  1H),  9.30 
(br  s,  1H),  8.09  (s,  2H),  7.98  (s,  1H),  7.05  (d,  7  =  1.6  Hz, 
1H),  6.84  (dd,  7  =  8.0,  2.0  Hz,  1H),  6.74  (d,  7  =  8.0  Hz, 
1H),  5.78  (s,  1H),  4.32  (d,  7=  6.0  Hz,  2H),  4.17  (d, 
7=  5.6  Hz,  4H),  2.66  (m,  1H);  13C  NMR  (100  MHz, 
DMSO-76):  3  163.85,  151.04,  149.12,  147.70,  145.57, 
144.51,  142.43,  123.09,  119.95,  115.60,  112.91,  101.78, 
73.25,  71.13,  38.67;  HRMS  (mlz):  [M+H]+  calcd  for 
C23H24N509,  514.16;  found,  514.16. 

4.7.  2-[0-(3,4-Dihydroxybenzaldoximyl)-methyl]-l,3- 
bis|0-(6-  uracilcarboxaldoximyl))-propane  (46) 

*H  NMR  (400  MHz,  DMSO-76):  S  11.15  (br  s,  1H),  8.05 
(s,  1H),  7.97  (s,  2H),  7.04  (d,  7  =2.0  Hz,  1H),  6.84  (dd, 
7=8.0,  2.0  Hz,  1H),  6.74  (d,  7=  8.0  Hz,  1H),  5.78  (s, 
2H),  4.32  (d,  7=6.4  Hz,  4H),  4.16  (d,  7=6.0  Hz,  2H), 
2.66  (m,  1H);  13C  NMR  (100  MHz,  DMSO-76):  S 
164.50,  151.72,  149.84,  148.39,  146.24,  145.16,  143.13, 
123.68,  120.61,  116.25,  113.61,  102.40,  73.73,  71.60, 
38.98;  HRMS  (mlz):  [M+H]+  calcd  for  C21H22N709, 
516.15;  found,  516.15. 

4.8.  1-|  0-(6-Uracilcarboxaldoximyl)l-3-|  0-(3,4-dihy- 
droxybenzaldoximyl)]-propane  (47) 

The  synthesis  and  characterization  of  this  oxime  hetero¬ 
dimer  has  been  previously  described33. 

4.9.  2-[  0-(6-Uracilcarboxaldoximyl)-methyl|-l  ,3-bis[  O- 
(3-carboxybenzaldoximyl)]-propane  (48) 

'H  NMR  (400  MHz,  DMSO-76):  5  8.34  (s,  2H),  8.15  (s, 
2H),  8.0  (s,  1H),  7.91  (d,  7  =6.8  Hz,  2H),  7.65  (d, 
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/  =  8.0  Hz,  2H),  7.38  (t,  /=  8.4  Hz,  2H),  5.77  (s,  1H), 
4.36  (d,  /  =  6.4  Hz,  2H),  4.28  (d,  /=  5.6  Hz,  4H),  2.82 
(m,  1H);  13C  NMR  (100  MHz,  DMSO-t/6):  5  168.54, 
163.88,  151.07,  149.16,  144.57,  142.60,  137.36,  131.43, 
130.62,  128.75,  128.27,  127.55,  101.86,  73.12,  71.50, 
38.67;  HRMS  (mlz):  [M+H]+  calcd  for  C^H^NjOc,, 
538.16;  found,  538.16. 

4.10.  2-|0-(3-Carboxybenzaldoximyl)-methyl|-l,3-bis|0- 
(6-uracilcarboxaldoximyl)]-propane  (49) 

:H  NMR  (400  MHz,  DMSO-d6):  S  11.17  (br  s,  1H),  8.32 
(s,  1H),  8.16  (s,  1H),  8.0  (s,  2H),  7.92  (dd,  7=6.8, 
1.6  Hz,  1H),  7.65  (d,  7=8.0  Hz,  1H),  7.39  (t, 
7=  7.2  Hz,  1H),  5.79  (s,  2H),  4.33  (d,  7=  5.6  Hz,  4H), 
4.25  (d,  7  =5.6  Hz,  2H),  2.82  (m,  1H);  13C  NMR 
(100  MHz,  DMSO-76):  S  168.29,  163.85,  151.10, 
149.15,  144.54,  142.46,  131.33,  130.67,  128.22,  127.58, 
101.70,  72.99,  71.39,  38.56;  HRMS  (mlz):  [M+H]+  calcd 
for  C22H22N709,  528.15;  found,  528.15. 

4.11.  l-[0-(6-Uracilcarboxaldoximyl)]-3-|0-(3-carbo- 
xybenzaldoximyl)]-propane  (50) 

The  synthesis  and  characterization  of  this  oxime  hetero¬ 
dimer  has  been  previously  described33. 

4.12.  2-{ 0-|2-(N  1  -uracil)-acetaldoximyl]-methyl}-l,3- 
bis|0-(3,4-dihydroxy-5-methoxybenzaldoximyl)|-propane 
(51) 

'H  NMR  (400  MHz,  DMSO-76)  S  11.32  (d,  2H),  9.14  (s, 
1H),  8.72  (s,  1H),  8.03  (q,  1H),  7.65  (dd,  1H),  7.56  (d, 
1H),  7.52  (dd,  1H),  6.92  (dd,  1H),  6.69  (s.  1H),  5.55 
(d,  2H),  4.50  (dd,  2H),  4.43  (t,  2H),  4.17-4.00  (m,  6H), 
3.70  (s,  3H),  1.28  (m,  1H);  13C  NMR  (400  MHz, 
DMSO-76)  S  (164.5,  164.4),  153.2,  (151.8,  151.5), 
(150.8,  150.0),  (149.1,  147.2),  (146.5,  146.3),  143.5, 

137.1,  122.7,  108.6,  103.2,  (101.9,  101.8),  72.4,  72.0, 
59.9,  (56.9,  56.5),  (46.5,  45.2);  HRMS  (mlz):  [M+H]+ 
calcd  for  C24H28N7O10  574.1892,  found  574.1885. 

4.13.2-|  0-(3,4-Dihydroxy-5-methoxybenzaldoximyl)-meth- 
yl|-l,3-bis{0-|2-(Nl-uracil)-acetaldoximyl)}-propane  (52) 

:H  NMR  (400  MHz,  DMSO-76)  S  11.33  (d,  1H),  9.14  (s, 
2H),  8.72  (s,  2H),  8.03  (d,  2H),  7.65  (d,  0.5H),  7.56  (d, 
0.5H),  7.53  (t,  0.5H),  6.93  (t,  0.5H),  6.69  (s,  2H),  5.55 
(d,  1H),  4.50  (d,  1H),  4.43  (d.  1H),  4.42-4.00  (m,  6H), 
3.70  (s,  6H),  1.24  (m,  1H);  13C  NMR  (400  MHz, 
DMSO-76)  S  (164.5,  164.4),  153.2,  (151.8,  151.5), 
(150.8,  150.0),  (149.1,  147.2),  (146.5,  146.3),  143.5, 

137.1,  122.7,  108.7,  103.1,  (101.9,  101.8),  72.4,  71.9, 
59.9,  (56.9,  56.4),  (46.5,  45.2);  HRMS  (mlz):  [M+H]+ 
calcd  for  C^oNjOn  588.1936,  found  588.1920. 


5.  In  vitro  inhibition  studies 

5.1.  High-throughput  inhibitor  screening  of  UDG 

The  molecular  beacon-based  HTS  assay  for  UDG  has 
been  previously  described.33  Briefly,  to  a  96-well  micro¬ 


titer  plate  was  added  5  pL  of  2  mM  total  compound  in 
DMSO,  followed  by  75  pL  33.3  pM  human  UNG  in 
reaction  buffer  (10  mM  Tris-HCl,  pH  8.0,  20  mM  NaCl, 
7.5  mM  MgCl2,  0.002%  brij-35).  The  reactions  were  ini¬ 
tiated  by  the  addition  of  20  pL  of  250  nM  molecular 
beacon  substrate  in  reaction  buffer.  The  plates  are  incu¬ 
bated  at  ambient  temperature  in  a  fluorescence  plate 
reader  for  30  min,  and  the  progress  of  the  reaction  was 
monitored  every  5  min  (Ex.  485  nm/Em.  520  nm).  The  fi¬ 
nal  concentrations  of  the  reagents  in  the  assay  are 
10  mM  Tris-HCl,  pH  8.0,  20  mM  NaCl,  7.5  mM 
MgCl2,  0.002%  Brij-35,  25  or  100  pM  human  UNG, 
50  nM  molecular  beacon  substrate,  100  pM  total  com¬ 
pound,  and  5%  DMSO.  IC50  analysis  was  performed 
using  the  same  conditions  except  that  the  concentration 
of  compound  was  varied  in  the  range  0.01-100  pM. 

5.2.  High-throughput  inhibitor  screening  of  dUTPase 

To  a  96-well  micro  titer  plate  were  added  20  pL  of  reaction 
buffer  (50  mM  Tris,  pH  8.0,  10  mM  MgCl2,  and  0.05% 
Tween  20)  and  5  pL  of  compounds  (2  mM)  in  DMSO,  fol¬ 
lowed  by  50  pL  of  dUTPase  (1  nM)  and  pyrophosphatase 
(10  U/mL)  in  reaction  buffer.  The  reactions  were  initiated 
by  the  addition  of  25  pL  dUTP  (40  pM).  The  plates  are 
incubated  at  ambient  temperature  for  50  min  and  the 
reactions  were  quenched  by  25  pL  of  malachite  green 
color  reagent.45  The  mixtures  were  then  allowed  to  stand 
for  10  min  and  the  absorbances  were  measured  using  a 
microtiter  plate  reader  with  a  620  nm  bandpass  filter. 
IC50  analysis  was  performed  using  the  same  conditions 
except  that  the  concentration  of  compound  was  varied 
in  the  range  0.1-75  pM. 
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ABSTRACT 

Human  nuclear  uracil  DNA  glycosylase  (UNG2)  is  a 
cellular  DNA  repair  enzyme  that  is  essential  for  a 
number  of  diverse  biological  phenomena  ranging 
from  antibody  diversification  to  B-cell  lymphomas 
and  type-1  human  immunodeficiency  virus  infectiv- 
ity.  During  each  of  these  processes,  UNG2  recog¬ 
nizes  uracilated  DNA  and  excises  the  uracil  base  by 
flipping  it  into  the  enzyme  active  site.  We  have  taken 
advantage  of  the  extrahelical  uracil  recognition 
mechanism  to  build  large  small-molecule  libraries 
in  which  uracil  is  tethered  via  flexible  alkane  linkers 
to  a  collection  of  secondary  binding  elements.  This 
high-throughput  synthesis  and  screening  approach 
produced  two  novel  uracil-tethered  inhibitors  of 
UNG2,  the  best  of  which  was  crystallized  with  the 
enzyme.  Remarkably,  this  inhibitor  mimics  the 
crucial  hydrogen  bonding  and  electrostatic  interac¬ 
tions  previously  observed  in  UNG2  complexes  with 
damaged  uracilated  DNA.  Thus,  the  environment  of 
the  binding  site  selects  for  library  ligands  that  share 
these  DNA  features.  This  is  a  general  approach  to 
rapid  discovery  of  inhibitors  of  enzymes  that 
recognize  extrahelical  damaged  bases. 


INTRODUCTION 

The  RNA  base  uracil  is  one  of  the  most  prevalent  non- 
canonical  bases  found  in  genomic  DNA  (1).  It  arises  from 
spontaneous  or  intentional  enzymatic  deamination  of  cytosine 
in  DNA  (2-5),  or  alternatively,  by  misincorporation  of  dUTP 
in  place  of  TTP  during  DNA  replication  (6).  Both  pathways 
for  uracil  incorporation  are  forms  of  DNA  damage,  and 
accordingly,  an  elaborate  uracil  base  excision  repair 
(UBER)  mechanism  is  present  in  all  organisms  to  reverse 
this  damage  (Figure  1  A)  (7).  Without  repair,  U/G  mismatches 
lead  to  T/A  transition  mutations  and  corresponding  changes 
in  protein  sequence.  Although  U/A  base  pairs  arising  from 


misincorporation  of  dUTP  are  not  mutagenic,  if  large  num¬ 
bers  of  uracils  are  inserted  on  both  strands  of  replicated 
DNA  this  can  lead  to  disruptions  in  gene  expression,  and 
even  double  strand  DNA  breaks  can  arise  from  the  base  exci¬ 
sion  repair  process  (8).  Although  the  accidental  appearance  of 
uracil  in  DNA  is  well-appreciated,  it  has  become  apparent 
that  enzymatic  deamination  of  cytosine  to  uracil  in  DNA 
plays  a  key  role  in  the  processes  of  somatic  hypermutation 
and  class  switch  recombination  in  B  cells  (2,9,10),  in  certain 
B  cell  lymphomas  (11),  and  as  an  innate  host  defense  mecha¬ 
nism  against  retroviral  infection  (12).  In  addition,  the  widely 
used  chemotherapeutic  agent  5-fluorouracil  (5-FU)  promotes 
uracil  misincorporation  into  DNA  by  increasing  the  ratio 
[dUTP]/[TTP]  in  the  cell,  suggesting  inhibitors  of  UBER 
could  serve  as  sensitizers  during  5-FU  therapy  (8,13).  In  gen¬ 
eral,  these  diverse  roles  for  uracil  indicate  that  small  molecule 
UBER  inhibitors  might  be  very  useful  investigational  or 
therapeutic  agents. 

In  humans,  the  first  step  in  the  UBER  pathway,  cleavage  of 
the  glycosidic  bond  of  deoxyuridine  in  DNA,  is  catalyzed  by 
the  powerful  nuclear  enzyme  uracil  DNA  glycosylase 
(UNG2)  (14).  This  extensively  studied  glycosylase  uses  an 
extrahelical  recognition  mechanism  in  which  the  uracil  base 
that  is  originally  embedded  in  the  DNA  base  stack  is  ulti¬ 
mately  extruded  into  the  enzyme  active  site  (Figure  IB) 
(15).  Thus  by  ‘uracil  flipping’  the  enzyme  can  recognize 
the  unique  structural  features  of  uracil  that  allows  catalysis 
to  proceed  (7).  Although  the  uracil  is  attached  to  a  large 
duplex  DNA  substrate,  most  of  the  enzyme-substrate  interac¬ 
tions  involve  the  base  itself.  Perhaps  not  surprisingly,  the 
uracil  base  alone  has  been  found  to  be  a  product  inhibitor 
of  the  enzyme  (K-,  ~  300  pM  at  physiological  pH)  (16). 

We  recently  published  a  general  strategy  for  rapid  discov¬ 
ery  of  small  molecule  inhibitors  of  UNG2  and  other  UBER 
enzymes  called  ‘substrate  fragment  tethering’  (SFT)  (17), 
which  is  an  efficient  variation  of  the  combinatorial  target- 
guided  ligand  assembly  method  of  Ellman  et  al.  (18).  The 
basic  approach  is  extremely  simple  and  involves  tethering  a 
chemical  library  of  aldehydes  to  pieces  of  substrates  (such 
as  uracil)  that  already  bind  weakly  to  an  enzyme  active 
site.  As  shown  in  Figure  2,  UNG2  library  synthesis  involves 
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Figure  1.  Uracil  DNA  base  excision  repair  and  extrahelical  recognition  of 
uracil.  (A)  Uracil  in  the  context  of  a  U/A  or  U/G  base  pair  is  repaired  by  a 
series  of  enzymatic  reactions  that  restore  the  integrity  of  the  DNA  sequence. 
The  first  enzyme  in  the  pathway  is  uracil  DNA  glycosylase  (UNG)  that 
hydrolytically  cleaves  the  A-glycosidic  bond  connecting  the  uracil  to  the 
deoxyribose,  leaving  an  abasic  site  and  free  uracil.  Three  other  enzymes 
complete  the  repair  process  in  humans:  an  abasic  site  endonuclease  (APE1),  a 
dual-function  repair  polymerase  (pol  (3)  that  inserts  the  correct  nucleotide  and 
eliminates  the  abasic  site  via  a  p-elimination  reaction,  and  finally,  DNA 
ligase.  (B)  Structure  of  uracilated  DNA  bound  to  human  UNG2  (PDB  code 
1EMH). 


Figure  2.  Chemistry  of  substrate  fragment  tethering.  In  this  approach,  a 
suitable  substrate  fragment  (such  as  uracil)  is  identified  and  derivatized  at  a 
nonperturbing  position  with  an  aldehyde  functional  group.  In  the  case  of 
UNG2,  the  substrate  fragment  ii  6-formyluracil.  The  substrate  fragment  is 
tethered  to  one  end  of  a  bifunctional  alkyloxyamine  linker  of  variable  length 
(n  —  2-6),  which  is  then  derivatized  on  the  other  end  with  a  library  of  aldehyde 
binding  elements  (RCHO).  Although  statistical  mixtures  of  all  possible  oximes 
result  (25%  each  homodimer  derived  from  uracil-CHO  or  RCHO,  and  50%  of 
the  heterodimer  derived  from  uracil-CHO  and  RCHO),  this  poses  no  difficulty 
because  the  crude  mixtures  are  directly  screened  for  inhibitory  activity.  Once 
active  mixtures  are  identified,  the  specific  inhibitory  components  can  be 
rapidly  identified  by  deconvolution  to  uncover  the  linker  length  that  gave  rise 
to  the  observed  inhibition.  The  structures  of  the  215  aldehyde  library  members 
used  in  this  study  are  reported  in  Supplementary  Table  SI. 

highly  efficient  formation  of  oxime  linkages  between  a  biva¬ 
lent  alkyloxyamine  linker,  a  uracil  aldehyde  derivative,  and 
each  library  aldehyde  member.  Thus,  the  uracil  fragment  tar¬ 
gets  the  entire  tethered  molecule  to  the  active  site  where  the 


library  pieces  can  then  explore  adjacent  binding  pockets.  SFT 
has  the  following  strengths:  (i)  library  synthesis  is  economi¬ 
cal  and  very  rapid  and  can  be  performed  in  microtiter  plate 
format,  (ii)  the  reactions  are  extremely  efficient  and  no  puri¬ 
fication  of  any  products  is  required,  (iii)  mixtures  of  flexible 
linkers  are  used  in  each  reaction  which  allows  multiple  teth¬ 
ering  lengths  to  be  probed  simultaneously  in  activity  screens, 
and  (iv)  the  method  is  easily  adaptable  to  any  desired  target. 

Here  we  report  the  results  from  high-throughput  screening 
(HTS)  of  an  SFT  library  derived  from  tethering  6-formyl 
uracil  to  a  library  of  215  different  aldehyde-binding  elements. 
Of  the  two  hits  identified  in  this  screen,  the  most  potent  SFT 
ligand  was  co-crystallized  with  UNG2  to  yield  a  high- 
resolution  structure  of  the  complex.  This  first  portrait  of  a 
bound  SFT  ligand  shows  how  a  small  molecule  can  surpris¬ 
ingly  mimic  the  hydrogen  bonding  and  electrostatic  interac¬ 
tions  of  a  larger  DNA  substrate.  Thus,  the  environment  of 
the  binding  site  appears  to  select  for  library  ligands  that 
share  molecular  features  of  DNA.  This  efficient  approach 
should  be  easily  adaptable  to  other  DNA  repair  glycosylases 
that  recognize  extrahelical  bases. 

MATERIALS  AND  METHODS 

Reagents  and  general  methods 

All  chemicals  were  purchased  from  commercial  sources  with¬ 
out  further  purification  unless  otherwise  stated.  The  'H  NMR 
spectra  were  recorded  on  a  400  MHz  Varian  Innova  instrument 
in  hexadeuteriodimethylsulfoxide  (DMSO-d6).  The  chemical 
shifts  of  protons  are  given  in  p.p.m.  with  DMSO  as  an  internal 
standard. 

Synthesis  of  tethered  oxime  libraries 

The  6-formyluracil  tethered  library  was  synthesized  as 
described  previously  (17).  Briefly,  a  set  of  215  alkyl  and 
aryl  aldehydes  (Supplementary  Table  SI)  was  selected  for 
library  synthesis  and  coupled  to  6-formyluracil  using  the 
O.O'-diaminoalkanediol  linkers  as  follows.  To  each  0.5  ml 
well  of  a  Matrix  microtiter  plate  was  added  a  DMSO  stock 
solution  of  AcOH  (20  pi,  150  mM,  3  pmol),  6-formyluracil 
(20  pi,  150  mM,  3  pmol)  and  a  single  alkyl  or  aryl  aldehyde 
(20  pi,  150  mM,  3  pmol).  The  plate  was  carefully  agitated  to 
make  the  solutions  homogenous.  To  each  of  the  mixtures  was 
added  a  DMSO  solution  of  the  O ,  O '  -  di  amino  alkane  dio  1  link¬ 
ers  containing  each  of  the  five  linker  lengths  in  equal  propor¬ 
tion  (22  pi,  150  mM,  3.3  pmol  total  amine  equivalents).  The 
plate  was  sealed,  further  agitated  and  incubated  in  an  oven 
for  12  h  at  37°C. 

Deconvolution  of  inhibitory  mixtures 

The  two  active  mixtures  containing  compounds  1  and  2  were 
deconvoluted  with  respect  to  linker  length  by  individually 
synthesizing  each  oxime  dimer  using  a  single  diaminoalkane- 
diol  linker  per  reaction.  At  this  stage  we  did  not  separate 
the  homodimers  from  the  heterodimers  in  the  mixtures.  The 
linker  length  dependence  of  the  inhibition  is  reported  in  Sup¬ 
plementary  Table  S2.  The  corresponding  compounds  were 
then  synthesized  in  larger  scale  and  purified  for  complete 
analysis  of  their  inhibition  properties  as  described  below. 


5874  Nucleic  Acids  Research,  2006,  Vol.  34,  No.  20 


Synthesis  of  1 

Solutions  (0.15  M)  of  4-carboxybenzaldehyde  (888  pi, 
0.165  mmol),  6-formyluracil  (888  pi,  0.165  mmol)  and  acetic 
acid  (888  jj.1,  0.165  mmol)  in  DMSO  were  added  to  a  reaction 
vessel.  The  reaction  was  initiated  by  the  addition  of  0.15  M 
0,0'-diaminoethanediol  (888  )_il,  0.165  mmol)  in  DMSO,  and 
incubated  at  37°C  for  36  h.  The  desired  heterosubstituted  com¬ 
pound  was  purified  by  direct  injection  of  the  reaction  mixture 
onto  a  Phenomenex  Aqua  reversed  phase  C- 1 8  HPLC  column 
(250  mm,  10  mm,  5  (tm)  using  gradient  elution  from  0  to  65% 
CH3CN  in  0. 1  M  aqueous  TEAA  over  the  course  of  2  h  using 
UV  detection  at  320  nm.  Fractions  containing  1  were  combined 
and  concentrated  in  vacuo.  The  compound  was  precipitated 
using  ice-cold  water,  centrifuged,  washed  twice  with  ice-cold 
water  and  dried  in  vacuo.  This  yielded  1  as  a  white  powder 
(9.6  mg,  0.028  mmol)  in  34%  yield.  'H  NMR  (400  MHz, 
DMSO-d6):  8  11.19  (s,  1H),  10.81  (s,  1H),  8.37  (s,  1H),  7.98 
(s,  1H),  7.96  (d,  3H),  7.72  (d,  2H),  5.79  (s,  1H),  4.66  (d,  4H); 
13C  NMR  (400  MHz,  DMSO-d6):  8  167.6,  163.9,  151.0, 
148.7,  144.5,  142.5,  129.8,  126.9,  102.0,  73.2,  71.9;  (m/z): 
[M+H]+  calcd  for  C15H14N406,  347.0986;  found,  347.0991. 

Synthesis  of  2 

The  synthesis  and  purification  was  identical  to  that  re¬ 
ported  above  for  1  except  that  0.15  M  solutions  of 
3-carboxybenzaldehyde  and  O/^-di  ami  nopropanediol  were 
used.  *H  NMR  (400  MHz,  DMSO-d6):  8  11.16  (s,  1H), 
10.78  (s,  1H),  8.34  (s,  1H),  8.18  (s,  1H),  7.94  (d,  2H),  7.83 
(d,  2H),  7.53  (t,  1H),  5.76  (s,  1H),  4.27  (t,  2H),  4.20  (t,  2H), 
2.10  (m,  2H);  13C  NMR  (400  MHz,  DMSO-d6):  8  166.9, 
163.9,  151.0,  148.3,  144.6,  142.2,  132.5,  131.5,  130.9, 
130.5,  129.2,  127.6,  101.7,  71.8,  70.5,  28.4;  (m/z):  [M+H]+ 
calcd  for  C15H14N406,  361.1143;  found,  361.1153. 

High-throughput  screening  of  oxime  library 

The  DNA  substrate  in  this  HTS  assay  was  synthesized  using 
standard  phosphoramidite  DNA  solid-phase  chemistry  using 
reagents  purchased  from  Glen  Research.  The  DNA  was  puri¬ 
fied  using  anion  exchange  chromatography  followed  by 
desalting  using  reversed  phase  methods.  The  sequence  and 
size  was  confirmed  using  analytical  denaturing  PAGE  and 
MALDI-MS.  The  substrate  is  a  single-stranded  28mer  DNA 
hairpin  containing  nine  U-A  base  pairs  and  a  hexapolyethy- 
lene  glycol  (PEG6)  linker  (5'-FAM-GCA  CUU  AAG  AAU 
UG-PEG6-CA  AUU  CUU  AAG  UGC-DABSYL-3').  The 
UNG2  HTS  assay  has  been  described  previously  (17). 

ICSo  determinations 

To  a  96-well  plate  was  added  5  pi  compound  1  in  DMSO,  fol¬ 
lowed  by  75  pi  of  66.5  nM  PEG-U9  hairpin  in  reaction  buffer 
(20  mM  Tris-HCl,  pH  8.0,  50  mM  KC1,  0.2  mM  MgCl2  and 
0.05%  Brij-35).  Eight  different  inhibitor  concentrations  were 
used  in  the  range  of  0.045-100  pM.  Reactions  were  initiated 
by  the  addition  of  20  pi  of  0.5  nM  human  UNG  in  reaction 
buffer.  The  final  concentrations  of  reagents  in  the  assay  are 
20  mM  Tris-HCl,  pH  8.0,  50  mM  KC1,  0.2  mM  MgCl2, 
0.05%  Brij-35,  5%  DMSO,  0.1  nM  human  UNG,  50  nM 
PEG-U9  hairpin  DNA  and  0-100  pM  1.  Wells  containing 
DMSO  vehicle  only  or  no  UNG2  were  used  as  negative  con¬ 
trols  and  background,  respectively.  The  plates  are  incubated 


at  ambient  temperature  in  a  fluorescence  plate  reader  for 
30  min,  and  the  progress  of  the  reaction  was  monitored 
every  5  min  (Ex.  485  nm/Em.  520  nm).  Percent  inhibition 
versus  log  concentration  of  1  data  were  fit  to  a  four  parameter 
sigmoidal  dose-response  equation  (Equation  1)  using  Prism 
4.0  (GraphPad  Software  Inc.,  San  Diego,  CA,  USA). 


%  Inhibition 


Min(Max  —  Min) 

1  _|_  io^logIC50~logW)*")  ’ 
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Mechanism  of  inhibition 

To  a  96-well  plate  was  added  5  pi  compound  in  DMSO,  fol¬ 
lowed  by  75  pi  PEG-U9  hairpin  in  reaction  buffer  (20  mM 
Tris-HCl,  pH  8.0,  50  mM  KC1,  0.2  mM  MgCU  and  0.05% 
Brij-35).  Eight  different  DNA  concentrations  were  used  in 
the  range  of  27.5-1100  nM.  Reactions  were  initiated  by  the 
addition  of  20  pi  of  0.5  nM  human  UNG  in  reaction  buffer. 
The  final  concentrations  of  reagents  in  the  assay  are  20  mM 
Tris-HCl,  pH  8.0,  50  mM  KC1,  0.2  mM  MgCl2,  0.05% 
Brij-35,  5%  DMSO,  0.1  nM  human  UNG,  27.5-1100  nM 
PEG-U9  hairpin  DNA  and  0-128  pM  of  1.  The  plates  were 
incubated  at  ambient  temperature  in  a  fluorescence  plate 
reader  for  60  min,  and  the  progress  of  each  reaction  was 
monitored  every  30  s  (Xcx  =  485  nm,  A.em  =  520  nm).  After¬ 
wards,  Escherichia  coli  UNG  was  added  to  each  well  to  drive 
the  reactions  to  completion,  and  the  total  change  in  fluores¬ 
cence  corresponding  to  complete  consumption  of  the  sub¬ 
strate  was  calculated  (AFUtot).  These  values  were  used  to 
calculate  initial  molar  velocities  (i.e.  [product]/s  =  ApM/ 
AFUtot  x  FU/s).  Mechanisms  of  inhibition  and  their  corre¬ 
sponding  inhibitor  dissociation  constants  were  determined 
by  Lineweaver-Burk  slope  and  intercept  replot  analysis. 

Cell  culture  studies 

Currently,  there  exists  no  simple  assay  to  monitor  the  efficacy 
of  inhibitors  of  UNG  within  living  cells.  In  order  to  evaluate 
the  ability  of  compound  1  to  inhibit  UNG  in  vivo,  inhibitor 
was  added  to  cells  in  culture,  lysates  were  carefully  prepared 
to  minimize  dilution  of  the  inhibitor,  and  UNG  activity  was 
assayed  as  follows  (Figure  3).  PC-3  human  prostate  adenocar¬ 
cinoma  cells  were  grown  in  Modified  Eagle’s  Medium 
(Mediatech)  supplemented  with  10%  fetal  bovine  serum 
(Hyclone)  and  penicillin/streptomycin.  Cells  at  70%  conflu- 
ency  were  trypsinized,  resuspended  in  growth  media  and 
counted  using  a  hemocytometer.  The  cells  were  then  centri¬ 
fuged  and  resuspended  to  a  density  of  1  x  106  cells/ml. 
Aliquots  of  1  x  106  cells  were  pelleted  and  resuspended  in 
46.5  pi  growth  media  with  or  without  3  mM  compound  1 
to  achieve  a  final  concentration  of  3  mM  inhibitor.  The 
cells  were  incubated  at  37  °C,  5%  C02  for  30  min  before 
being  spun  down  and  resuspended  in  7  pi  UNG  lysate  reac¬ 
tion  buffer  [10  mM  Tris-HCl,  pH  8.0,  60  mM  NaCl,  1  mM 
DTT,  1  mM  EDTA,  0.002%  Brij  35,  protease  inhibitors 
(Roche  Complete,  Mini)].  The  cells  were  lysed  by  freeze¬ 
thawing  five  times,  the  lysates  were  centrifuged  at 
14000  r.p.m.  for  30  min  and  the  supernatant  was  transferred 
to  a  fresh  tube,  and  the  volume  was  now  increased  by  3.5  pi 
based  on  the  number  of  cells  pelleted  (106)  and  the  average 
cell  volume  of  3.5  pi  for  PC-3  cells  (19).  Thus,  it  is  calculated 
that  3.5  pi  x  3  mM  (1)  =  10  nmol  of  compound  1  was  trapped 
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Figure  3.  Assay  for  small  molecule  inhibition  of  UNG  in  cells. 


in  the  pelleted  cells.  The  UNG  activity  was  measured  using 
the  PEG-U9  hairpin  substrate  and  2  |il  lysate  in  a  total  reac¬ 
tion  volume  of  150  pi.  Assuming  equilibration  of  1  across  the 
cell  membrane,  its  final  concentration  in  the  fluorescence 
reaction  was  13  pM.  Initial  reaction  velocities  were  calcu¬ 
lated  using  the  total  change  in  fluorescence  as  described 
(1)  and  normalized  to  protein  content  measured  using  the 
BioRad  protein  assay. 

Crystallization  of  the  complex  of  UNG2  and  3-(2)-A8 

Human  UNG2  was  expressed  and  purified  as  described 
previously  (20).  A  solution  of  human  UNG2  (112.5  pi, 
44.2  mg/ml)  in  a  buffer  containing  50  mM  Tris-OAc, 
pH  7.0,  150  mM  NaCl  and  1  mM  DTT,  was  mixed  with  1 
(12.5  |il,  16.8  mM)  in  100  mM  Tris-HCl,  pH  8.0  and  5% 
DMSO.  The  mixture  was  allowed  to  incubate  at  ambient 
temperature  for  30  min,  and  then  centrifuged  at  10000X  g 
for  5  min.  Co-crystallization  conditions  were  screened 
using  the  Nextal  PEG  Suite  library.  A  total  of  300  nl  of  the 
complex  was  mixed  with  an  equal  volume  of  precipitant, 
and  allowed  to  crystallize  at  22°  C  using  the  hanging  drop 
method.  Crystals  were  observed  within  48  h  with  0.2  M  pot¬ 
assium  thiocyanate,  20%  PEG  3350.  X-ray  diffraction  data 
were  collected  from  a  flash  frozen  crystal  in  its  unmodified 
mother  liquor  at  the  National  Synchrotron  Light  source  at 
Brookhaven  National  Laboratory  (beam  line  X6A)  using  a 
wavelength  of  1.1  A  with  a  ADSC  CCD  detector  Quantum- 
4.  The  package  HKL2000  (21)  was  used  for  data  reduction. 

The  structure  was  determined  by  molecular  replacement 
with  the  program  MOLREP  using  the  uncomplexed  UNG2 
structure  (1KHZ)  as  the  searching  model.  After  an  initial 
rigid-body  refinement,  compound  1  was  placed  in  a  difference 
Fourier  electron  density.  The  final  model  of  the  UNG2-1 
complex,  refined  using  REFMAC5  (22)  with  isotropic  tem¬ 
perature  factors,  shows  all  non-glycine  residues  in  allowed 


regions  of  the  Ramachandran  plot  and  excellent  stereochem¬ 
istry  (Table  1).  Riding  hydrogens  of  protein  atoms  were 
used  in  REFMAC5.  The  structural  statistics  are  reported 
in  Table  1. 


RESULTS  AND  DISCUSSION 

We  previously  used  SFT  to  identify  two  uracil-tethered 
inhibitors  of  UNG2  from  a  small  aldehyde  library  containing 
only  14  different  aldehyde  binding  elements  (17).  Although 
these  SFT  compounds  had  K,  values  between  0.3  and 
3  pM,  the  non-uracil  binding  elements  were  derived  from 
unstable  di-  or  trihydroxybenzaldehydes.  Thus,  these  initial 
compounds  were  prone  to  air  oxidation  to  the  inactive  qui- 
none  forms,  and  were  not  suitable  for  structural  or  cell  culture 
studies.  We  therefore  turned  to  screening  a  much  larger 
215  member  aldehyde  library  using  our  high-throughput 
molecular  beacon  fluorescence  assay  (see  Supplementary 
Data)  (17).  This  screening  effort  resulted  in  the  identification 
of  two  new  active  mixtures  (Figure  4).  After  deconvolution 
to  identify  the  linker  length  that  gave  rise  to  inhibition,  and 
purification  of  the  individual  inhibitory  compounds,  the  IC50 
values  were  determined  (Figure  5A). 

The  library  binding  elements  that  gave  rise  to  the  observed 
inhibition  shared  a  common  chemical  structure  The  two  most 
potent  compounds  1  and  2,  with  IC50  values  of  9  and  1 1  pM, 
respectively,  both  shared  formate- substituted  benzaldehyde 
functional  groups  and  short  alkyl  chain  linker  lengths  of 
n  =  2  or  3  (Figure  4).  These  structure-activity  trends  sug¬ 
gested  the  presence  of  a  binding  pocket  directly  adjacent  to 
the  uracil  binding  site  that  depends  on  positioning  of  the 
negatively  charged  formate  groups  of  either  1  or  2.  To  date, 
tethering  library  binding  elements  to  the  6-formyl  uracil 
substrate  fragment  has  brought  about  increases  in  binding 
affinity  of  up  to  —3  kcal/mol  as  compared  to  the  O-methyl 
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Table  1.  Crystallographic  data  collection  and  refinement  statistics 


Space  group 

P212121 

1  and  c  =  70.4  A 

Cell  dimensions 

a  =  43.2,  b  =  69. 

Resolution  range  (A) 

49.3-1.3 

/?Sym  (Last  shell)3 

0.06  (0.49) 

Completeness  (Last  shell) 

98.0  (92.4)% 

Multiplicity  (Last  shell) 

4.9  (4.2) 

Had)  (Last  shell) 

10.2  (2.6) 

Number  of  reflections 

51470 

Refinement 

F  Data  cutoff  in  o  (F)  units 

0 

Number  of  atoms 

Average  B  (A2) 

Protein 

1808 

11.1 

Solvent 

315 

21.0 

Ligand 

25 

13.8 

Total 

2148 

12.6 

R-v  alue 

0.18 

/(free  (test  Set  of  5%) 

0.21 

Stereochemical  constraints 

Bond  length  rms  (A) 

0.007 

Bond  angles  rms  (degrees) 

1.26 

Improper  angles  rms  (degrees) 

0.07 

aRSym  =  T,h  J2j  I hj  -  (4)1/  J2h  T.J  \hj\,  where  h  represents  a  unique  reflec- 
tion  and  j  means  symmetry  equivalent  indices,  I  is  the  observed  intensity,  and 
</>  is  the  mean  value  of  I. 


Figure  4.  Structure  of  inhibitory  compounds  identified  from  high-throughput 
screening.  IC50  values  were  determined  for  each  purified  compound  and  a  full 
mode-of-inhibition  analysis  and  structural  characterization  was  performed  for 
compound  1.  The  IC50  value  for  compound  1  is  equivalent  to  its  true  K,. 


oxime  derived  from  6-formyl  uracil  and  methoxyamine 
(Ki  =  45  |dM)  (17). 

Since  1  showed  the  highest  activity  we  investigated  its 
mode  of  inhibition  in  greater  detail.  Interestingly,  most  of 
our  previously  reported  SFT  inhibitors,  as  well  as  the  uracil 
base  itself,  showed  surprisingly  complex  modes  of  inhibition 
with  either  competitive  or  partial  uncompetitive  binding  to 


A. 


1/[DNA]  (nM1) 


Figure  5.  Inhibition  by  1.  (A)  Concentration  dependence  of  inhibition.  The 
curve  is  a  nonlinear  least-squares  fit  to  Equation  1  (IC50  =  9  ±  1  (iM). 
(B)  Mode-of-inhibition  analysis  for  compound  1.  Linear  competitive 
inhibition  was  observed:  Kx  =  6  ±  1  (iM. 


two  distinct  uracil  binding  sites  (17).  This  surprising 
complexity,  which  is  not  entirely  understood  at  a  structural 
level,  was  attributed  to  the  presence  of  a  second  weak  uracil 
binding  site  that  may  be  occupied  transiently  during  the 
process  of  uracil  flipping  into  the  active  site.  In  contrast 
with  the  previous  complexity,  the  inhibition  patterns  for 
1  indicated  simple  competitive  inhibition  (Figure  5B),  with 
linear  Lineweaver-Burk  slope  replots  (data  not  shown). 
SFT  inhibitor  1  also  showed  an  IC50  value  against  full-length 
UNG2  that  was  only  25%  greater  than  the  catalytic  domain 
(the  full  length  UNG2  was  assayed  using  cell  extracts).  The 
full-length  UNG2  protein  differs  from  the  catalytic  domain 
by  a  90  amino  acid  N-terminal  extension  that  is  involved  in 
nuclear  localization  and  other  protein  interactions  (23). 
Thus,  mode  of  inhibition  analysis  indicates  that  1  competes 
for  binding  to  the  extrahelical  uracil  binding  site  observed 
in  the  uracilated-DNA  complex  shown  in  Figure  IB  and 
that  its  inhibitory  potency  is  not  affected  by  the  N-terminal 
extension  present  in  nuclear  UNG2. 

We  also  investigated  the  potency  of  1  in  cell  culture.  Since 
there  is  no  simple  marker  for  assessing  UNG2  inhibition  in 
cell  culture,  an  ex  vivo  assay  was  developed  to  assess  whether 
the  inhibitor  enters  cells  and  binds  to  UNG2.  In  this  assay, 
cells  are  treated  with  a  single  high  concentration  of  inhibitor 
(3  mM),  and  then  carefully  diluted  cell  extracts  are  prepared 
for  fluorometric  assay  of  UNG2  activity  (17).  Assuming  full 
equilibration  of  the  inhibitor  across  the  cell  membrane,  and 
taking  into  account  extract  dilutions  and  measured  cell 
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numbers  and  volumes  (19),  the  concentration  of  1  in  the  final 
UNG2  assay  was  calculated  to  be  13  JJ.M  (Figure  3).  This 
concentration  consistently  gave  rise  to  20  ±  4%  inhibition 
of  UNG2  activity  relative  to  control  extracts  prepared  ident¬ 
ically  and  in  parallel,  which  is  <70%  inhibition  expected  from 
a  competitive  inhibitor  with  a  =  6  JIM.  This  difference 
may  reflect  that  (i)  1  is  poorly  membrane  permeable,  or 
(ii)  that  1  is  not  metabolically  stable  in  the  intracellular  envi¬ 
ronment.  In  this  respect,  oximes  are  known  to  be  reduced  by 
microsomal  NADH  cytochrome  b5  reductase  (24,25). 

To  evaluate  the  structural  basis  for  inhibition,  UNG2  was 
cocrystallized  with  1  and  diffraction  data  were  collected  to 
1.3  A  resolution  and  refined  to  an  7?factor  and  7?free  of  0.19 
and  0.22,  respectively  (PDB  ID  2HXM,  Figure  6A).  In  con¬ 
trast  to  damaged  DNA  binding,  which  leads  to  a  contraction 
of  the  active  site  structure  (15)  (26),  binding  of  1  led  to  only 
minor  structural  changes  as  compared  to  uncomplexed  UNG2 
(pdb  code  1AKZ),  with  an  r.m.s.  deviation  over  221  C-a 
atoms  of  only  0.58  A.  Despite  the  differences  in  induced  fit 
binding  as  compared  to  damaged  DNA,  1  remarkably  shares 
many  of  the  binding  interactions  observed  in  the  uracilated- 
DNA  complex  (Figure  6B  and  C).  The  DNA  binding  site  of 
UNG2  is  composed  of  a  uracil  recognition  pocket  flanked  by 
a  deep  groove  which  is  predominantly  involved  in  accommo¬ 
dating  the  single  strand  of  DNA  that  contains  the  extrahelical 
uracil.  In  the  DNA  complex,  specific  hydrogen  bonding  and 
aromatic  stacking  interactions  with  the  extrahelical  uracil 
involving  Asn204,  His268,  Glnl44  and  Phel58  are  observed 
(Figure  6B).  In  addition,  UNG2  also  makes  important  interac¬ 
tions  via  neutral  and  charged  hydrogen  bonds  with  the  5'  and 
3'  phosphodiester  groups  of  the  deoxyuridine  and  the  3' 
phosphodiester  group  of  the  3'  adjacent  nucleotide  using  the 
y-hydroxyls  of  Serl69,  Ser270  (data  not  shown)  and  Ser247, 
respectively  (Figure  6B).  The  uracil  base  of  1  shares  the 
uracil  interactions  seen  with  the  uracilated-DNA  complex, 
with  the  exception  of  the  catalytically  important  short  hydro¬ 
gen  bond  between  uracil  02  and  His268  (16,27,28).  The 
planar  oxime  linkage  at  the  uracil  side  of  the  tether  is 
observed  to  extend  directly  over  the  space  that  is  occupied 
by  the  deoxyribose  ring  of  deoxyuridine  in  the  DNA  com¬ 
plex,  but  then,  the  alkane  linker  sharply  kinks  such  that  the 
oxime  linkage  connecting  to  the  benzylformate  moiety  nearly 
perfectly  superimposes  the  path  taken  by  the  sugar  phosphate 
backbone  of  the  DNA  3'  to  the  deoxyuridine  nucleotide 
(Figure  6C).  This  trajectory  of  the  linker  presents  the  car- 
boxylate  substituent  of  the  benzyl  ring  such  that  it  forms  a 
charged  tridentate  hydrogen  bond  with  the  backbone  amide 
groups  of  Ser247  and  Tyr248  and  the  y  hydroxyl  of  Ser247 
(Figure  6B).  These  interactions  with  the  carboxylate  group 
mimic  those  of  the  3'  phosphodiester  group  of  the  nucleotide 
directly  adjacent  to  deoxyuridine  in  the  DNA  complex 
(Figure  6C).  In  addition,  the  oxime  oxygen  on  the  uracil 
side  of  the  tether  accepts  a  hydrogen  bond  from  the 
y-hydroxyl  of  Seri 69  thereby  mimicking  the  interaction  of 
the  5'-phosphate  of  dUrd  in  the  DNA  complex.  Due  to  differ¬ 
ences  in  induced  fit  binding  between  1  and  uracilated-DNA, 
the  catalytic  His268  is  too  far  from  uracil  02  to  form  the 
strong  hydrogen  bond  seen  in  the  DNA  complex.  Instead, 
His268  stacks  over  the  benzyl  ring  of  1  to  form  a  3.6  A 
n-n  aromatic  interaction  (Figure  6B).  Overall,  1  shares 
three  of  the  four  hydrogen  bond  interactions  with  the  uracil 


Figure  6.  Interactions  of  1  and  damaged  DNA  with  the  active  site  of  UNG2. 
(A)  Global  structure  of  inhibitor-UNG2  complex.  (B)  Discrete  interactions  of 
1  with  the  active  site  of  UNG2.  The  2F0-FC  electron  density  map  is  shown  at  a 
contour  level  of  la.  (C)  Overlay  of  1  (gold)  with  the  region  of  the  damaged 
DNA  strand  (turquoise)  containing  uracil  and  the  adjacent  two  3'  nt  (1EMH). 

base  observed  in  the  DNA  structure  and  three  of  the  five 
DNA  backbone  hydrogen  bonds. 

This  structure  also  provides  useful  insights  into  the  inhibi¬ 
tion  provided  by  compound  2,  as  well  as  our  previously 
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characterized  tight-binding  SFT  compound  (3,  K,  =  300  nM) 
that  contains  a  2,4,5-trihydroxybenzyl  substituent  (17). 

Manual  docking  studies  suggest  that  the  longer  three  car¬ 
bon  linker  of  2  is  used  to  extend  its  m-formate  substituent 
such  that  it  can  serve  as  a  DNA  phosphate  mimic  as  observed 
for  the  p-formate  substituent  of  1.  Our  previously  reported 
tight-binding  SFT  compound  contained  a  p-hydroxyl  sub¬ 
stituent  and  also  a  linker  that  is  one  carbon  longer  than  1  sug¬ 
gesting  that  the  p-hydroxyl  is  positioned  to  form  neutral 
hydrogen  bonding  interactions  in  the  same  pocket  occupied 
by  the  p-formate  group  of  1.  Apparently,  the  20-fold  higher 
affinity  of  this  previous  SFT  ligand  arises  from  favorable  pre¬ 
sentation  of  all  of  its  hydroxyl  substituents.  It  is  interesting  to 
note  that  the  215  member  aldehyde  SFT  library  contains  only 
two  carboxylate  compounds,  and  both  of  these  were  detected 
as  inhibitors  in  HTS  when  the  correct  linker  length  was 
employed  (i.e.  compounds  1  and  2).  Thus,  binding  elements 
possessing  molecular  features  similar  to  the  DNA  substrate 
arise  more  frequently  as  inhibitors,  suggesting  that  libraries 
enriched  in  such  motifs  might  have  higher  hit  rates. 

The  SFT  approach  may  find  general  utility  in  targeting 
enzymes  that  recognize  extrahelical  bases.  The  flexible 
alkane  tether  appears  to  be  an  accommodating  scaffold  that 
allows  favorable  presentation  of  binding  elements  that  are 
complementary  to  the  DNA  binding  surface  of  the  enzyme. 
More  generally,  substrate  fragments  (or  weak  binding  lig¬ 
ands)  that  target  enzyme  active  sites  should  make  excellent 
starting  places  for  rapid  inhibitor  development  by  this  or 
other  tethering  approaches  (29). 

SUPPLEMENTARY  DATA 

Supplementary  Data  are  available  at  NAR  online. 
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